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EDITORIAL 


Persuasive words are not enough 


ommunicating the findings of science plays a vital 
role in shaping our lives and the planet. From my 
vantage point at the Science family of journals, I 
witness daily how editors, journalists, and scien- 
tists work together to deliver scientific informa- 
tion to the world. Given the recent rapid pace of 
discovery, relaying remarkable findings has never 
been busier. So why aren’t these efforts having a bigger 
positive effect on the public acceptance of science? 

It’s baffling that as the world struggles to tame coro- 
navirus disease 2019 (COVID-19), a large portion of the 
population ignores the fact that wearing masks and 
practicing social distancing dampen the spread of the 
pandemic. Even when a vaccine for COVID-19 becomes 
available, the benefits of achieving herd immunity will 
be endangered if growing antivac- 
cine sentiment leads some folks to 
refuse to get vaccinated. American 
science denialism, in particular, 
persists, even at the highest level 
of leadership, with a president 
who denies climate change and a 
vice president—a devout creation- 
ist—who believes that Earth is only 
6000 years old. 

[hear a lot from our readers and 
stakeholders about how to solve 
this problem of science denial. 
Most of them suggest that Science 
and science “need to do a better job 
at telling its stories.” I don’t buy it. For more than a 
century, this journal has been delivering insightful and 
reliable scientific information; today, our articles have 
the highest readership ever. Sure, we can do a better job 
of simplifying messages and making them accessible to 
more people. There is always room for improvement. 
But is this really the crux of this dangerous problem? 

The scientific community is up against a sophisti- 
cated, data-driven machine that is devoted to making 
sure that science doesn’t fully succeed, and the history 
of this is quite clear. A recent Science editorial pointed 
out that U.S. Republican politicians embraced Earth 
Day when it started 50 years ago. But in the 1980s, digi- 
tal analysis of political polling data based on location 
fostered the formation of the anti-science movement in 
the United States—politicians and their supporters who 
did not like the results criticized the findings and the 
process. It became fashionable and politically expedi- 
ent to run against science. Any carve-outs for the en- 
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community is 
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vironment in the conservative government went away 
in favor of the dismantling of regulations grounded in 
evidence. Over time, digital technologies have become 
more sophisticated, and now there is a massive, churn- 
ing, finely tuned digital misinformation machine that 
has seized social media to ensure that a portion of the 
population doesn’t accept science. And this battle be- 
tween science fact and fiction isn’t just being waged in 
the United States—the United Kingdom, Russia, India, 
and Brazil all face a similar predicament. 

The current implications of this battle in the United 
States are everywhere. The administration has promul- 
gated the idea that severe acute respiratory syndrome 
coronavirus 2 (SARS-CoV-2, the cause of COVID-19) was 
engineered in China’s Wuhan Institute of Virology, in 
part based on a non-peer-reviewed 
preprint that was later retracted. 
The misinformation about masks 
and social distancing is spurring 
dangerous bar gatherings and choir 
practices. Unsubstantiated claims 
in a “plandemic” video are convinc- 
ing citizens that Dr. Anthony Fauci, 
the longtime leader of the U.S. Na- 
tional Institute of Allergy and In- 
fectious Diseases, is hiding a secret 
business deal from which he stands 
to profit from COVID-19. The anti- 
science movement started with the 
environment, which could hurt our 
long-term survival, but in the era of COVID-19, it threat- 
ens our immediate survival. 

The scientific community is losing the battle against 
this digital leviathan of misinformation. A well-reasoned 
and highly placed op-ed on this topic is not going to move 
the needle, no matter how well it is crafted to adhere to 
the best practices in science communication. Neither is 
a perfect trade book, television appearance, or speaking 
tour by a scientific leader. The only way to win this fight 
is to harness the same sophisticated tools in the name of 
science that are being used to tear science down. With 
social media companies afraid to challenge the misin- 
formation machine, even when their own platforms are 
being misused, the task is daunting. But we can at least 
move on from the idea that if we could just find those 
perfect, persuasive words, everyone would suddenly real- 
ize that facts are facts with no alternatives. 


-H. Holden Thorp 
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Editor-in-Chief, 
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Mask study draws fire 


PUBLIC HEALTH | Dozens of epidemio- 
logists are calling for the retraction of a 
paper claiming that face masks are “the 
most effective means” to prevent transmis- 
sion of the virus that causes COVID-19. 

In the study, published on 11 June in the 
Proceedings of the National Academy of 
Sciences, Nobel laureate and atmospheric 
chemist Mario Molina of the University 

of California, San Diego, and colleagues 
analyzed COVID-19 case counts and mask 
policies in areas with large outbreaks, 
concluding that social distancing and other 
measures to control coronavirus transmis- 
sion were “insufficient by themselves.” 
Last week, more than 40 scientists signed 
a letter to the journal acknowledging the 
importance of masks but calling the study 
flawed. For example, they argue it didn’t 
properly account for an “enormous set of 
changes across society” that could have 
affected COVID-19 incidence, indepen- 
dent of mask use. The scientists also ask 
that the journal reassess its long-standing 
policy of allowing U.S. National Academy 
members such as Molina to select their 
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own reviewers for some submissions. 
(Neither of the paper’s listed reviewers is 
an epidemiologist.) Molina has defended 
the paper, saying the critics failed to under- 
stand the role of aerosol particles in viral 
transmission and simply distrust work by 
scientists outside their field. 


Antimalarial drug trials end 


CLINICAL RESEARCH | The World Health 
Organization (WHO) and U.S. National 
Institutes of Health (NIH) decided sepa- 
rately last week to stop large clinical trials 
evaluating whether hydroxychloroquine 
can safely and effectively treat people 
hospitalized with COVID-19, an idea touted 
by U.S. President Donald Trump early 

in the pandemic. In an interim analysis 
announced on 20 June, NIH’s trial data 
and safety monitoring board said the anti- 
malarial drug was safe but “very unlikely to 
be beneficial.” WHO cited similar grounds 
on 17 June in permanently halting the 
hydroxychloroquine arm of its Solidarity 
trial. Both decisions came after investi- 
gators running the U.K. Recovery trial 
announced on 5 June they had stopped 
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People exercise on 15 June behind plastic sheets 

at a gym in Redondo Beach, California, that reopened 
under state guidelines that call for continuing 

social distancing. Public health experts worry the 
reopenings are increasing COVID-19 infections 

in many U.S. states. 


giving hydroxychloroquine to patients 
because the data showed no benefit. That 
study, the largest investigating drugs to 
treat COVID-19, continues. 


Steroid shortage feared 


DRUG THERAPY | The World Health 
Organization (WHO) has called for 
increased production of dexamethasone, 

a commonly used corticosteroid recently 
shown to reduce death rates of COVID-19. 
“Hoarding and speculative procurement 
appear to have already started,” says Emer 
Cooke, WHO’s head of regulation of medi- 
cines and other health technologies. But 
she says it’s “probably too early to say if 
there will be a global shortage.” The great- 
est concern is for the injectable version of 
the drug, which some physicians say is the 
preferred formulation, especially for criti- 
cally ill patients. But it is more complicated 
to produce than oral dexamethasone, and a 
major Indian manufacturer has faced regu- 
latory scrutiny for production problems. 
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Another step toward a new megacollider 


ERN, the European particle physics laboratory near Geneva, 

announced last week it will study the technical and financial 

feasibility of an atom smasher 80 to 100 kilometers in cir- 

cumference. To be built in the 2040s, the megacollider would 

succeed the lab’s 27-kilometer Large Hadron Collider (LHC), 

smashing electrons into positrons to study in detail the Higgs 
boson, the famous particle the LHC discovered in 2012. But the 23 na- 
tions that fund CERN have made no commitment to build the mam- 
moth new collider, for which researchers produced a rough design 
18 months ago, says CERN Director-General Fabiola Gianotti. The fea- 
sibility study should be done by 2026 or 2027. The electron-positron 
collider could open the way to later building an even more powerful 
proton smasher like the LHC in the same tunnel. 


NSF director starts work 


LEADERSHIP | Sethuraman “Panch” 
Panchanathan began his 6-year term this 
week as head of the National Science 
Foundation (NSF). The 59-year-old com- 
puter scientist, who had been a senior 
administrator at Arizona State University, 
Tempe, was confirmed by the U.S. Senate 
on 18 June, 6 months after President 
Donald Trump nominated him to be the 
agency’s 15th director. Kelvin Droegemeier, 
Trump’s science adviser, had been double 
hatting as acting NSF director since France 
Cordova completed her term on 31 March. 


Critics ask to strip Watson name 


RACIAL JUSTICE | Spurred by the recent 
groundswell of the Black Lives Matter 
movement, scores of students and alumni 
of a prestigious doctoral program are ask- 
ing its trustees to remove Nobel laureate 
James Watson’s name from the program, 
called the Watson School of Biological 
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Temperature on 20 June in 
the town of Verkhoyansk, 
Russia, the highest ever recorded 
north of the Arctic Circle. 
The average June high there 
is just 20°C. Unusually 
warm temperatures in Siberia 
this year have melted 
permafrost and fed wildfires. 
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Sciences. The request from 133 students 
and alumni of the Cold Spring Harbor 
Laboratory (CSHL) program, in a letter 
sent 21 June, says Watson has made rac- 

ist comments since at least 2007, when a 
newspaper quoted him saying Black people 
were of inferior intelligence, leading CSHL 
to remove him as chancellor. In 2019, after 
he told PBS his views had not changed, the 
lab stripped his remaining honors except 
for the school’s name. Watson’s name, 

the letter states, “is now inextricably linked 
with racism.” It adds: “This name change 
should be the first step in a larger process 
to make [CSHL] an inclusive and support- 
ive environment, especially for people of 
color” A spokesperson for Marilyn Simons, 
chair of CSHL’s Board of Trustees, wrote in 
an email that changing the school’s name 
requires 75% of the board to vote to amend 
CSHLs charter; a meeting is scheduled 

for early July. Watson, 92, who co-discovered 
DNAs structure and was CSHL’s director 
and president for decades, could not be 
reached for comment. 
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Trump suspends high-tech visas 


IMMIGRATION | President Donald 
Trump’s new executive order limiting 
the entry of foreign workers through the 
end of the year spares two visa categories 
important to U.S. universities but halts 
new visas in a third category. The 

22 June executive order, which Trump 
says will protect U.S. jobs, does not affect 
the Optimal Practical Training program 
that allows graduating international 
students to stay in the country for up to 
3 years. It also exempts visiting research- 
ers and scholars coming under the 

J-1 visa program. But the order does 
block the issuance of any new so-called 
H-1B visas, a category that both universi- 
ties and high-tech companies use to 
attract skilled foreign workers. None of 
the new restrictions applies to visa 
holders already in the country. Google 
CEO Sundar Pichai tweeted that he’s 
“disappointed” by the order and that 

the company “will continue to stand 
with immigrants.” 


X-rays reveal black hole bonanza 


ASTRONOMY | An orbiting x-ray tele- 
scope launched last year has released its 
first all-sky image, recording more than 
1 million objects, twice as many sources 
as previously detected in 60 years of 
x-ray astronomy. Most of the dots in the 
image recorded by eROSITA (extended 
Roentgen Survey with an Imaging 
Telescope Array) are supermassive black 
holes at the centers of galaxies, where 
they gorge on gas heated so hot that it 
can be seen across the universe. The 
project’s 4-year goal is to map millions of 
such galaxies to understand how gravity 
gloms them together in clusters and 
how the mysterious dark energy pushes 
against gravity to accelerate expansion 
of the universe. 


The x-ray universe, based on 
6 months of data from 
the eROSITA 
satellite. 
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Pulses of ultrasound aimed at a motion-sensitive brain area (from brown device, right) improved people’s ability to judge dots’ direction of motion. 


Hope grows for targeting the brain with ultrasound 


Noninvasive method of tweaking neural activity enters new tests in humans 


By Kelly Servick 


s away to see inside the body, revealing 
a tumor or a fetus, ultrasound is tried 
and true. But neuroscientists have a 
newer ambition for the technology: 
tinkering with the brain. At frequen- 
cies lower than those of a sonogram 
but still beyond the range of human hearing, 
ultrasound can penetrate the skull and boost 
or suppress brain activity. If researchers can 
prove that ultrasound safely and predictably 
changes human brain function, it could be- 
come a powerful, noninvasive research tool 
and a new means of treating brain disorders. 
How ultrasound works on the brain re- 
mains mysterious. But recent experiments 
have offered reassurance about safety, and 
small studies hint at meaningful effects in 
humans—dampening pain, for example, or 
subtly enhancing perception. “I’ve seen a 
lot of tantalizing data,’ says Mark Cohen, a 
neuroscientist at the University of California, 
Los Angeles (UCLA). “While the challenges 
are very large, the potential of this thing is so 
much larger that we really have to pursue it.” 
Scientists can already modulate the brain 
noninvasively by delivering electric current 
or magnetic pulses across the skull. The U.S. 
Food and Drug Administration (FDA) has 
approved transcranial magnetic stimulation 
(TMS) to treat depression, migraine pain, 
and obsessive-compulsive disorder (OCD). 
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But unlike magnetic or electric fields, 
sound waves can be focused—like light 
through a magnifying glass—on a point 
deep in the brain without affecting shal- 
lower tissue. For now, that combination 
of depth and focus is possible only with a 
surgically implanted wire. But ultrasound 
could temporarily disrupt a deep human 
brain region—the almond-shaped amyg- 
dala, a driver of emotional responses, for 
example, or the thalamus, a relay station for 
pain and regulator of alertness—to test its 
function or treat disease. 

Results in animals are encouraging. Ex- 
periments in the 1950s first showed ultra- 
sound waves could suppress neural activity 
in a visual region of the cat brain. In rodents, 
aiming ultrasound at motor regions has trig- 
gered movements such as a twitch of a paw 
or whisker. And focusing it on a frontal re- 
gion of monkey brains can change how the 
animals perform at eye movement tasks. 

But it’s technically tricky to aim ultra- 
sound through thick, dense skull bone and 
to show its energy has landed at the in- 
tended point. And ultrasound’s effects on 
the brain can be hard to predict. How much 
it boosts or suppresses neural activity de- 
pends on many parameters, including the 
timing and intensity of ultrasound pulses, 
and even characteristics of the targeted 
neurons themselves. “I have tremendous 
excitement about the potential,” says Sarah 
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Hollingsworth Lisanby, a psychiatrist at the 
National Institute of Mental Health who 
studies noninvasive neuromodulation. “We 
also need to acknowledge that there’s a lot 
we have to learn,” she says. 

For one thing, researchers are largely in 
the dark about how sound waves and brain 
cells interact. “That’s the million-dollar ques- 
tion in this field,” says Mikhail Shapiro, a bio- 
chemical engineer at the California Institute 
of Technology. At high intensities, ultrasound 
can heat up and kill brain cells—a feature 
neurosurgeons have exploited to burn away 
sections of brain responsible for tremors. 

Even at intensities that don’t significantly 
increase temperature, ultrasound exerts a 
mechanical force on cells. Some studies sug- 
gest this force alters ion channels on neurons, 
changing the cells’ likelihood of firing a signal 
to neighbors. If ultrasound works primarily 
via ion channels, “That’s great news,” Sha- 
piro says, “because that means we can look 
at where those channels are expressed and 
make some predictions about what cell types 
will be excited.” In a preprint on bioRxiv last 
month, Shapiro’s team reported that expos- 
ing mouse neurons in a dish to ultrasound 
opens a particular set of calcium ion chan- 
nels to render certain cells more excitable. 

But these channels alone won’t explain 
ultrasound’s effects, says Seung-Schik Yoo, 
a neuroscientist at Harvard University. He 
notes that ultrasound also appears to affect 
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receptors on nonneuronal brain cells called 
glia. “It’s very hard to [develop] any unifying 
theory about the exact mechanism” of ultra- 
sound, he says. 

Regardless of mechanism, ultrasound is 
starting to show clear, if subtle, effects in hu- 
mans. In 2014, a team at Virginia Polytech- 
nic Institute and State University showed 
focused ultrasound could increase electrical 
activity in a sensory processing region of 
the human brain and improve participants’ 
ability to discern the number of points be- 
ing touched on their fingers. Neurologist 
Christopher Butler at the University of Ox- 
ford and colleagues have tested ultrasound 
during a more complex sensory task: judg- 
ing the motion of drifting, jiggling dots on a 
screen. Last month at the Cognitive Neuro- 
science Society’s annual meeting online, he 
reported that stimulating a motion-process- 
ing visual region called MT improved sub- 
jects’ ability to judge which way the majority 
of the dots drifted. 

Ultrasound’s effects have so far been sub- 
tler than those of TMS, says Mark George, a 
psychiatrist at the Medical Uni- 
versity of South Carolina, who 
helped develop and refine that 
technology. With TMS, “you 
put it on your head and turn 
it on and your thumb moves,’ 
he says. But the ultrasound ex- 
periments that prompted paw 
twitches in mice used intensi- 
ties “so, so, so much higher than 
what we're being allowed to use 
in humans.” 

Regulators have limited hu- 
man studies in part because 
ultrasound has the potential to 
cook the brain or cause damage 
through cavitation—the cre- 
ation of tiny bubbles in tissue. In 
2015, Yoo and colleagues found microbleeds, 
a sign of blood vessel damage, in sheep brains 
repeatedly exposed to ultrasound. “This was 
a huge speed bump,” says Kim Butts Pauly, 
a biophysicist at Stanford University. But in 
February in Brain Stimulation, her group 
reported microbleeds in control animals as 
well, suggesting this damage might result 
from dissection of the brains. Butts Pauly and 
Yoo now say they’re confident the technology 
can be used safely. 

Cohen and collaborators recently tested 
safety in people by aiming ultrasound at 
regions slated for surgical removal to treat 
epilepsy. With FDA’s OK, they used intensi- 
ties up to eight times as high as the limit for 
diagnostic ultrasound. As they reported ina 
preprint on medRxiv in April, they found no 
significant damage to brain tissue or blood 
vessels. However, to find the limit of safety, 
researchers will likely need to go all the way 
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challenges are 
very large, the 

potential of 
this thing is so 

much larger 
that we really 


have to pursue it.” 
Mark Cohen, UCLA 


to levels that damage tissue, Cohen says. 

Several teams are cautiously moving into 
tests of ultrasound as treatment. In 2016, 
UCLA neuroscientist Martin Monti and col- 
leagues reported that a man in a minimally 
conscious state regained consciousness fol- 
lowing ultrasound stimulation of his thala- 
mus. Monti is preparing a publication on a 
follow-up study of three people with chroni- 
cally impaired states of consciousness. After 
ultrasound, they showed increased respon- 
siveness over a period of days—much faster 
than expected, Monti says, although the 
study included no control group. 

That research and the tests in epilepsy pa- 
tients used an ultrasound device developed by 
BrainSonix Corporation. Its founder, UCLA 
neuropsychiatrist Alexander _ Bystritsky, 
hopes ultrasound can disrupt neural cir- 
cuits that drive symptoms of OCD. A team at 
Massachusetts General Hospital and Baylor 
College of Medicine is planning a study in 
humans using the BrainSonix device, he says. 

Columbia University biomedical engineer 
Elisa Konofagou hopes to use ultrasound to 
treat Alzheimer’s disease. Be- 
fore COVID-19 interrupted par- 
ticipant recruitment, she and 
colleagues were preparing a pi- 
lot study to inject tiny gas-filled 
bubbles into the bloodstream 
of six people with Alzheimer’s 
and use pulses of ultrasound 
to oscillate the microbubbles in 
blood vessels lining the brain. 
The mechanical force of those 
vibrations can temporarily 
pull apart the cells lining these 
vessels. The researchers hope 
opening this blood-brain bar- 
rier will help the brain clear 
toxic proteins. (Konofagou’s 
team and others are also ex- 
ploring this ultrasound-microbubble combi- 
nation to deliver drugs to the brain.) 

In his first test of ultrasound after years 
of studying TMS, George looked to reduce 
pain. His team applied increasing heat to 
the arms of 19 participants, who tended to 
become more sensitive over repeated tests, 
reporting pain at lower temperatures by the 
last test. But if, between the first and last 
test, they had pulses of ultrasound aimed at 
the thalamus, their pain threshold dipped 
half as much. “This is definitely a double 
green light” to keep pursuing the technol- 
ogy, George says. 

George regularly treats depressed pa- 
tients with TMS and has seen the technol- 
ogy save lives. “But everybody wonders if we 
could go deep with a different technology— 
that would be a game changer,’ he says. 
“Ultrasound holds that promise, but the 
question is can it really deliver?” 
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Lawsuit alleges 
scientific 
misconduct at 
weapons lab 


Dispute underscores 
the challenges 

of using simulations 
to “test” warheads 


By Adrian Cho 


n unusual lawsuit alleges scientific 

misconduct at Lawrence Livermore 

National Laboratory in California, 

one of the United States’s three nu- 

clear weapons labs. Peter Williams, 

a 50-year-old physicist, worked at 
Livermore from January 2016 until May 
2017, when he says he was fired in retali- 
ation for complaining that his superiors 
were mishandling a computer program that 
simulates the detonation of high explosives, 
undermining their ability to predict how a 
particular nuclear weapon would perform if 
used. Williams, who now works at a private 
research lab, has sued Livermore and seven 
individuals for reinstatement and $600,000 
in damages. 

Researchers familiar with the labs say 
Williams’s allegations should be taken se- 
riously. “If there’s been a cover-up, that’s 
something that ought to be looked into,’ 
says Raymond Jeanloz, a geophysicist at 
the University of California, Berkeley, who 
has been involved with the weapons labs. 
But he also says the labs implement inter- 
nal reviews and other measures to ensure 
the integrity of their work and head off the 
kind of problem Williams alleges. “This is 
exactly the kind of thing the people at the 
lab worry about,’ Jeanloz says. Livermore 
declined to comment on the suit, but in a 
statement said: “Rigorous debate is a part 
of the scientific process—the Laboratory 
does not retaliate against individuals for 
holding differing opinions.” 

The suit, which Williams filed on 22 May, 
seems quixotic. He is representing himself; 
to make his case, he needs documents that 
only the lab can provide; and his complaint 
centers on a differential equation. Williams 
spent only a short time at Livermore before 
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he was fired. (In a 12-month performance 
review Williams included in his suit, his 
superiors state he wasn’t keeping up with 
assignments.) Before joining the lab he did 
two postdocs, taught at City College of San 
Francisco and Sonoma State University, and 
worked for 8 years at Agilent Technologies. 
But Williams is a talented scientist, says 
Craig Wheeler, an astrophysicist at the Uni- 
versity of Texas, Austin, who was his gradu- 
ate adviser and has subsequently published 
with him. “He’s a deep, inde- 
pendent thinker,’ Wheeler says. 
“He’s definitely not a crackpot.” 
At Livermore, Williams was 
given the unclassified task of 
modeling the behavior of a high 
explosive. Called PBX 9502, the 
polymer-bonded explosive is in- 
tended to be used in refurbish- 
ing a 40-year-old thermonuclear 
warhead called the W80, which 
is about the size of a garbage 
can and fits on a cruise missile. 
The explosive, when detonated, 
would compress the warhead’s 
plutonium pit to set off a nuclear 
fission explosion, which in turn 
would trigger an even more powerful fusion 
explosion. Modeling a high explosive is dif- 
ficult, researchers say, because its behavior 
depends on its density, which can vary from 
batch to batch and within a single piece. 
But such modeling is crucial to be sure 
that stored weapons will work as intended 
because the United States gave up nuclear 
testing in 1992 under the still unratified 
Comprehensive Nuclear-Test-Ban Treaty. 
PBX 9502 is especially tricky because it’s 
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“The motive 
was to keep 
money flowing 
to help develop 
and improve 
CHEETAH.” 


Peter Williams, 
formerly of Lawrence 
Livermore National 
Laboratory 
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Peter Williams claims tweaks to a program for modeling a bomb’s explosive trigger make its predictions unreliable. 


an insensitive explosive, meaning it won’t 
go up if smacked or set on fire. But that 
also means that as the detonation front 
moves through the material, the chemical 
reactions behind it progress more slowly 
than in older explosives, Williams says. So 
instead of assuming the reactions are in- 
stantaneous, researchers must model their 
evolution, too. 

To make sure they can do so correctly, re- 
searchers detonate samples of the stuff in 
different geometries, observe 
the explosions with ultra-high- 
speed cameras and other tools, 
and compare the results with 
simulations. For example, in 
one test, the sample resembles 
a mallet, with a thin cylindrical 
handle that leads to a fatter cy- 
lindrical head. Because the det- 
onation front cannot instantly 
turn the corner where skinny 
cylinder meets fat, the explo- 
sion doesn’t consume the entire 
sample, but leaves a ring of ma- 
terial whose thickness reveals 
the speed of the detonation 
front and chemical reactions. 

Livermore used such test explosions to 
refine a modeling program called ARES- 
CHEETAH. But Williams says he became 
aware that one of his supervisors was essen- 
tially changing parameters in the program 
after the fact to make sure the simulation 
fit the data from each experimental setup. 
That’s “pseudoscience,’ Williams charges, as 
it guarantees the model will look accurate, 
even if it isn’t. He says he repeatedly asked 
the researcher to explain the rationale for the 


Published by AAAS 


tweaking. “It became clear that he didn’t have 
a rationale, he had a motive,’ Williams says. 
“And the motive was to keep money flowing 
to help develop and improve CHEETAH.” 

Such “curve fitting” renders the program 
incapable of making meaningful predic- 
tions, he says. If CHEETAH can’t properly 
model the bench tests of PBX 9502, larger, 
far more expensive integrated test explo- 
sions of the weapon’s nonnuclear compo- 
nents could fail, he says. And modelers may 
not be able to correctly calculate the yield of 
the refurbished weapon, he says. 

That would be a big problem, says 
Robert Rosner, a theoretical astrophysicist 
at the University of Chicago and former di- 
rector of Argonne National Laboratory who 
is on Livermore’s advisory board. The un- 
certainty in the yields of various weapons 
directly affects calculations of how big the 
U.S. nuclear arsenal should be and could 
determine the course of future arms con- 
trol negotiations, Rosner says. “The extent 
to which we can decrease [the stockpile] is 
completely related to the ability to be cer- 
tain about the yield.” 

But Rosner says he’s neither surprised 
nor alarmed to hear researchers had to 
adjust the high-explosives models to fit 
different experiments. What Williams sees 
as unacceptable tweaking is an inevitable 
part of the empirical modeling of any com- 
plex system, such as the climate or a star, 
Rosner says. “The idea that the model has 
to be tuned, I find that unsurprising,” he 
says. “I would be stunned if it didn’t.” 

Both Rosner and Jeanloz say they suspect 
Williams may not know the whole story 
behind ARES-CHEETAH, as the lab dis- 
missed him before he obtained a clearance 
to work on classified material. The weapons 
program also takes explicit measures to en- 
sure that researchers adhere to scientific 
standards, Jeanloz says, such as running its 
own classified peer-reviewed journals and 
encouraging researchers to keep a hand in 
unclassified research. The rivalry between 
Livermore and Los Alamos National Labo- 
ratory in New Mexico provides a particu- 
larly valuable check, scientists say. 

Yet Jeanloz and Rosner agree that mod- 
els like CHEETAH need careful scrutiny. 
That’s because some policymakers may view 
uncertainties in modeling as a reason to re- 
sume nuclear testing. In fact, Jeanloz says, 
some arms control experts argue that to dis- 
courage calls for new tests, the United States 
should make no changes to its existing nu- 
clear arms—even ones that make them safer, 
such as switching to insensitive explosives. 

Williams does not yet have a court date. 
He’s suing, he says, not out of anger, but out 
of a sense of scientific duty: “I couldn’t look 
myself in the mirror if I didn’t do it.” 
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ANIMAL DOMESTICATION 


Dawn of the chicken revealed in Southeast Asia 


Chicken domestication—debated since Charles Darwin—tracked by genomic survey 


By Andrew Lawler 


t is the world’s most common farm ani- 

mal as well as humanity’s largest single 

source of animal protein. Some 24 billion 

strong, it outnumbers all other birds by 

an order of magnitude. Yet for 2 centu- 

ries, biologists have struggled to explain 
how the chicken became the chicken. 

Now, the first extensive study of the 
bird’s full genome concludes that people in 
northern Southeast Asia or southern 
China domesticated a colorful pheas- 
ant sometime after about 7500 B.C.E. 
Migrants and traders then carried the 
bird across Asia and on to every conti- 
nent except Antarctica. 

“Our results contradict previous 
claims that chickens were domesti- 
cated in northern China and the Indus 
Valley,’ researchers led by Ming-Shan 
Wang from the Chinese Academy of 
Sciences’s Kunming Institute of Zoo- 
logy write in a paper published today 
in Cell Research. They also found that 
the modern chicken’s chief ancestor is 
a subspecies of red jungle fowl named 
Gallus gallus spadiceus. 

“This is obviously a landmark study,’ 
says Dorian Fuller, an archaeologist at 
University College London who was not 
involved in the effort. He adds that the 
results could shed light on the emer- 
gence of agriculture and early trade 
networks, and what features of the bird 
made it so attractive to people. 

Charles Darwin argued the chicken 
descended from the red jungle fowl be- 
cause the birds resemble each other and 
can make fertile offspring; he specu- 
lated that domestication happened in 
India. But five varieties of the pheasant 
inhabit a broad arc extending from the 
jungles of Indonesia to the Himalayan 
foothills of Pakistan. Which variety led 
to the chicken, and where, was uncertain. 
Based on presumed chicken bones, archaeo- 
logists claimed, variously, that people do- 
mesticated the bird 9000 years ago in north- 
ern China and 4000 years ago in Pakistan. 

DNA studies promised to resolve the is- 
sue, but researchers had few samples from 
the bird’s wild relatives. So Jianlin Han, 
a geneticist at the Joint Laboratory on 
Livestock and Forage Genetic Resources, 
embarked on a 20-year project to sample 
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indigenous village chickens and wild jungle 
fowl near more than 120 villages across 
Asia and Africa (Science, 23 November 2012, 
pp. 1020 and 1022). 

Wang's team sequenced the full genomes 
of 863 birds and compared them. The results 
suggest modern chickens descend primarily 
from domesticated and wild varieties in what 
is now Myanmar, Laos, Thailand, and south- 
ern China (see map, below). “This region is a 
center of domestication,” says co-author and 


Early bird 


One subspecies of the red jungle fowl (Gallus gallus spadiceus, 
bottom), found in northern Southeast Asia, likely led to the first 
domesticated chickens, genomes of wild and tame birds show. 
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geneticist Olivier Hanotte of the University of 
Nottingham. The results confirm a hypothesis 
put forward in 1994 by Japan’s Crown Prince 
Akishino, an ornithologist, on the basis of mi- 
tochondrial DNA data. 

Wang’s team did find some evidence for 
a South Asian contribution: A jungle fowl 
native to the Indian subcontinent may have 
interbred with the chicken after its initial do- 
mestication in Southeast Asia, the team says. 

The new DNA data link domesticated 
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chickens most closely to the Southeast 
Asian subspecies G. g. spadiceus, however. 
They suggest the lineage that became the 
modern chicken branched off from the jun- 
gle fowl between 12,800 and 6200 years ago, 
with domestication occurring sometime af- 
ter the lineages split. 

Fuller doubts the bird was fully domes- 
ticated before the arrival of rice and millet 
farming in northern Southeast Asia about 
4500 years ago. Hanotte acknowledges that 
“we need the help of archaeologists” to 
understand the human events that trig- 
gered domestication. 

But Jonathan Kenoyer, an archaeo- 
logist and Indus expert at the Univer- 
sity of Wisconsin, Madison, remains 
skeptical that the chicken arose in 
Southeast Asia. “They need to get an- 
cient DNA’ to back up their claims, 
he says, because genomes of modern 
birds may provide limited clues to early 
events in chicken evolution. 

Nor does the DNA show what first 
enticed people to tame the bird. Early 
varieties were far scrawnier and pro- 
duced fewer eggs than today’s indus- 
trial varieties, and their predators were 
legion. Some researchers suggest the 
bird was initially prized for its exotic 
plumage or for cockfighting. Selling 
prize fighting cocks remains a lucra- 
tive business in Southeast Asia, and 
the birds’ high value may have spurred 
traders to carry them long distances. 

Smithsonian Institution archaeo- 
zoologist Melinda Zeder calls the new 
paper “fascinating” and says it shows 
“the domestication and dispersal story 
is more complicated than we thought.” 
She urges combining genetic and 
archaeological data to flesh out the 
tale. Archaeologists are now gather- 
ing chicken bones that suggest farmers 
in southern China and Southeast Asia 
first domesticated the bird some 3500 years 
ago—findings that bolster the genetic work. 

Han’s group, meanwhile, is creating a mas- 
sive data set based on more than 1500 modern 
chicken genomes from Asia, Europe, and Af- 
rica. The researchers plan to analyze chicken 
dispersal into Europe and Africa, as well as 
the genetic variations behind traits such as 
the ability to withstand disease or produce 
more eggs. “This study opens a whole new 
page in chicken genomics,” Han says. & 
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COVID-19 


Charities that fund research 
face deep revenue declines 


Cancellation of major fundraisers forces groups to lay 
off staff and delay, trim, or cancel grants to scientists 


By Eli Cahan 


arly this year, University of Colorado, 
Denver, cancer researcher Patricia 
Ernst was thrilled when her postdoc 
Therese Vu won a grant from the Leu- 
kemia & Lymphoma Society, a non- 
profit that has pumped more than 
$1.2 billion into blood cancer research since 
its founding in 1949. The funding would al- 
low the scientists to launch studies using a 
technique to generate malignant leukemia 
from immature blood cells—an approach 
that Ernst had been eager to try for more 
than a decade. Then, last month, the pair got 
bad news: The philanthropic group canceled 
the grant, citing “unprecedented” revenue 
losses caused by the COVID-19 pandemic. 

“J did anticipate there would be cut- 
backs,” Ernst says. “But I didn’t think it 
would be that serious, and I didn’t think it 
would happen to us.” 

Many researchers are having similar 
experiences. Foundations that fund bio- 
medical research in the United States, the 
United Kingdom, and elsewhere are report- 
ing record revenue drops because of the pan- 
demic. One major factor: It has forced them 
to cancel key fundraising events, includ- 
ing galas, walks, Broadway partnerships, 
and even an event that sends thousands 
of U.S. firefighters into the streets, ask- 
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ing passersby to support medical research 
by dropping donations into a rubber boot. 
Many groups are trying to stem the losses 
by cutting staff and delaying, trimming, or 
outright canceling grants to researchers. 
The chaos imperils a small, but pivotal, 
part of the scientific ecosystem. Although 
nonprofits provide just 5% of overall U.S. 
research funding, they often support small, 
high-risk pilot studies that later en- 


A fundraiser in which firefighters collect donations for 
muscular dystrophy research was canceled this year. 


men, the largest nonprofit funder of breast 
cancer research, has laid off about 20% of 
its 211 employees, closed 30 of its 61 local af- 
filiates, and tabled future grant cycles. The 
Muscular Dystrophy Association, which 
counts on its annual “Fill the Boot” fund- 
raiser with firefighters for 25% of its annual 
revenue, has furloughed many of its devel- 
opment staff and canceled plans to award 
new grants. At the $724 million Ameri- 
can Cancer Society (ACS), a $200 million 
drop in revenue has prompted it to lay off 
1000 of its some 4300 employees. “If cur- 
rent trends continue,” chief medical officer 
William Cance says, ACS could temporarily 
cut research funding by half. 

In the United Kingdom, the Association 
of Medical Research Charities (AMRC), 
whose members last year sent £1.9 bil- 
lion to biomedical researchers (compared 
with £1.8 billion in funding provided 
by the U.K. government), is reporting a 
38% drop in fundraising revenue. That 
has forced groups to withdraw or defer 
grants, says AMRC CEO Aisling Burnand. 
Cancer Research UK, which funds half of 
the country’s noncommercial cancer re- 
search, has cut its funding by about 10%, or 
£44 million, says CEO Michelle Mitchell. 
The cuts will get deeper if charities do not 
receive more support from the government, 
she adds. And the crisis could have long- 
lasting ripple effects on the next generation 
of research. “We’re in danger of destroying 
a decade’s worth of work, infrastructure, 
and future talent,’ Mitchell says. 

Vu, for example, had hoped to have gath- 
ered enough data from her pilot study by 

October to apply for a grant from 


able researchers to attract larger Se the National Institutes of Health 
grants from government funders. SerenCes (NIH). Now, even if she can find 
And many of the grants go to young oe replacement funding, she thinks 
researchers, helping them launch is supported it will be an additional 12 to 
their careers. “If you're in a room by the 18 months before she can apply to 


with researchers of vascular disease, 
almost all of them will say their first 
grant came from [us], says Mariell Jessup, 
chief science and medical officer at the 
American Heart Association (AHA). 

So far, Jessup says, AHA has been fortu- 
nate: Although donations have dropped, the 
$890 million organization hasn’t had to lay 
people off or rescind grants—but it has post- 
poned awarding a new round of grants. 

The red ink is drowning other US. 
groups. At the National Multiple Sclero- 
sis Society, which last year spent about 
$40 million of its $190 million budget on 
research, officials forecast a $60 million 
shortfall in 2020; they’ve given 78 of their 
198 grantees a 15% “haircut.” Susan G. Ko- 
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Pulitzer Center. 


NIH. And because the leukemia re- 
searcher is from Australia, a fund- 
ing cutoff could imperil her U.S. work visa. 
“T don’t want to be all ‘woe is me, but ... the 
junior people have gotten hammered” by 
the disruption, Vu says. 

The loss of nonprofit grants could also 
hurt researchers seeking funding for high- 
risk ideas that can’t get support from gov- 
ernment funders, says Maryrose Franko, 
CEO of the Health Research Alliance, which 
represents 85 nonprofit research funders. 
“We derisk research for the government, and 
we embrace failure,” Franko says. “If we're 
not funding it, who will?” 


With reporting by Cathleen O'Grady. 
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COVID-19 


Could a blood ‘observatory’ stop pandemics? 


Proposal calls for screening blood from around the world for thousands of antibodies 


By Robert Bazell 


ichael Mina is out for blood— 

millions of samples, which a na- 

scent effort dubbed the Global 

Immunological Observatory (GIO) 

would monitor for signs of patho- 

gens spreading through the popu- 
lation. Instead of a telescope, it will rely on 
technology that can measure hundreds of 
thousands of distinct antibodies in a micro- 
liter of blood. If the GIO can overcome tech- 
nical and logistical hurdles and find sus- 
tained funding, he says, it could provide a 
powerful tool for monitoring and respond- 
ing to disease outbreaks. 

For now, the idea is just a pi- 
lot project to track the spread of 
COVID-19. The stealthy spread of 
that disease through the popula- 
tion underscored the need for such 
a monitoring system, says Mina, an 
immunologist and epidemiologist at 
Brigham and Women’s Hospital and 
the Harvard T.H. Chan School of 
Public Health, who with colleagues 
outlines the GIO concept this month 
in eLife. (The co-authors include 
Jeremy Farrar, an infectious dis- 
ease specialist and director of the 
Wellcome Trust, as well as vaccine 
and immunology specialists Adrian 
McDermott and Daniel Douek of 
the National Institutes of Health.) 

Disease surveillance in the 
United States now relies on a 
patchwork of hospitals, clinics, and 
doctors to report unusual events to state 
health departments, which pass the infor- 
mation on to the Centers for Disease Con- 
trol and Prevention. The need for faster, 
more comprehensive surveillance, Mina 
says, “was starkly clear with the inability 
to identify and model local circulation of 
COVID-19 in a timely fashion.” 

Mina wants to watch for outbreaks by 
looking for antibodies to infectious agents 
in regularly collected, anonymized blood 
samples from every possible source—blood 
banks, plasma collection centers, even the 
heel needle sticks of newborns, which are 
taken in most states from every baby in or- 
der to identify genetic diseases. The samples 
would be identified only by geographical 
area. Chip-based platforms that can identify 
hundreds of thousands of antibodies are al- 
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ready produced commercially by companies 
including VirScan and Luminex. Mina says 
these could easily be scaled up to look at 
huge numbers of samples, either individu- 
ally or in batches. 

“This is an extraordinary and exciting 
concept,” says infectious disease specialist 
William Schaffner of the Vanderbilt Univer- 
sity Medical Center. “It is an example of the 
kind of fresh new thinking we need in pub- 
lic health.” But, Schaffner adds, “The logisti- 
cal challenges for such an endeavor could 
be daunting.” 

Mina and his co-authors envision ini- 
tially testing about 10,000 samples per day 
and later, if they secure funding to build 


Antibodies in donated blood or other samples could reveal where 
previously identified pathogens are returning or new ones emerge. 


up the project, some 100,000 per day for 
the United States alone. Even the smaller 
number could detect—far faster than the 
current reporting system—an outbreak of 
Zika virus in rural Louisiana, for example, 
or an eruption of West Nile virus in Colo- 
rado. The GIO could also accelerate the 
monitoring of seasonal influenza, allowing 
hospitals to prepare for patient surges and 
for public health officials to efficiently dis- 
tribute vaccines. 

When a new infectious disease such as 
COVID-19 appears, the GIO could track 
its spread. The antibody-detecting chips 
wouldn’t necessarily have to be updated to 
spot a new pathogen, such as SARS-CoV-2, 
the virus that causes COVID-19. Research- 
ers might see a rise in antibodies that non- 
specifically target known pathogens—and 
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might flag their unknown relatives. For ex- 
ample, a burst of antibodies that cross-react 
with known coronaviruses would likely have 
been seen in Chinese cities, such as Wuhan, 
as SARS-CoV-2 began to spread there. 

The idea of regularly monitoring entire 
populations for antibodies arose in the lab 
of evolutionary biologist Bryan Grenfell at 
Princeton University, where Mina worked as 
a postdoctoral fellow. Now, Mina has joined 
Grenfell and Jessica Metcalf, also an evolu- 
tionary biologist at Princeton, in expanding 
the concept. Antibodies can signal not just 
people who are currently infected, but also 
those who had the disease and recovered. 
The GIO would also be able to distinguish dif- 
ferent strains of a bacterium or virus 
infecting people because each pro- 
duces a unique antibody signature. 

The GIO team is already building 
a pilot laboratory in Massachusetts, 
while it looks to secure financial 
support. “Given the importance 
we believe this could have, we are 
beginning to look for funding from 
some of the major philanthropic 
donors of public health work,” Mina 
says. “We are currently exploring 
and open to all options.” 

Meanwhile, the team’s pilot proj- 
ect, supported by the Open Phi- 
lanthropy foundation, is gathering 
many thousands of anonymous 
blood samples from the plasma- 
collecting company Octapharma. 
By screening them for antibodies 
to SARS-CoV-2, Mina and his col- 
leagues hope to learn how useful widespread 
antibody testing can be in tracing the spread 
of the new coronavirus and possibly predict- 
ing future “hot spots” or localized outbreaks. 

People often do not develop antibodies 
until well after infections; for SARS-CoV-2 
it takes 1 or 2 weeks. But Mina says the 
antibody testing still provides valuable in- 
formation. “A week into an outbreak isn’t 
huge,” he said. “For example, if we were do- 
ing this with [blood from] just a small frac- 
tion of New York, we would have detected 
that [SARS-CoV-2] was there in February 
and could have given [Governor Andrew] 
Cuomo plenty of ammunition to close down 
the city March 1 instead of March 19.” 


Robert Bazell, an adjunct professor at Yale University, 
is a journalist based in New Haven, Connecticut. 
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SCIENTIFIC COMMUNITY 


Online GRE test heightens 


equity concerns 


Pandemic spurs graduate programs to drop 
standardized test requirements 


By Jane C. Hu 


s COVID-19 swept across the United 
States, standardized testing centers 
closed and the GRE General Test—an 
exam that’s required for admission to 
many U.S. graduate programs—went 
online. The Educational Testing Ser- 
vice (ETS), which offers the GRE, “completely 
revamped its delivery model so [aspiring 
graduate students] can test from the safety 
of home,’ it declared in May. Since then, 
though, scores of academics have 
raised concerns about the equity 
of the online version of the test, 
arguing it disadvantages prospec- 
tive students from rural and low- 
income backgrounds. “If I were ... 
a student trying to take this exam, 
satisfying [the online testing] cri- 
teria would be extremely difficult 
for me,” says Emily Levesque, an 
assistant professor of astronomy 
at the University of Washington, Seattle. 
Levesque wrote about ETS’s testing re- 
quirements in a Twitter thread this month, 
detailing what she sees as “a shopping list 
of hurdles.” Test takers must have access to 
a computer with a webcam—“tablets and 
smartphones won't cut it,’ she wrote—as 
well as a private room in a home with a sta- 
ble internet connection. Libraries and other 
public spaces are out. “We already know 
from virtual teaching this spring that not 
all students/prospective grads have access 
to [computers] in their homes,” she wrote. 
On top of that, test takers must have a 
whiteboard if they want to take notes, sit 


? 


in a standard—not “overstuffed”—chair, and 
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“Itwas simply 
a question 
of access.” 


Emily Levesque, 
University of 
Washington, Seattle 


ensure that no one enters their room for 
the duration of the 4-hour test. In a state- 
ment, Alberto Acereda, executive director 
of higher education at ETS, wrote that the 
rules are “necessary to ensure the testing 
experience is similar to that in a test center, 
as well as to maintain the security and in- 
tegrity of the test.” 

Yet to some academic departments al- 
ready questioning the value of the GRE, the 
burdensome requirements of the at-home 
test are a tipping point. Levesque’s depart- 
ment decided to temporarily sus- 
pend requiring GRE scores. “It 
was simply a question of access,” 
she says. “If we require the exam 
this year, that puts an excessive 
burden on folks we want to en- 
courage to apply.” 

Other departments have de- 
cided to forgo the GRE for good. 
“We’ve been thinking about 
{eliminating it] for a long time,” 
says Chrissy Wiederwohl, assistant depart- 
ment head for engagement and graduate 
affairs for Texas A&M University, College 
Station’s oceanography department, which 
voted to stop requiring GRE scores earlier 
this month. “COVID is what helped front- 
burner it.” 

Levesque’s and Wiederwohl’s depart- 
ments join a growing list of U.S. graduate 
programs that have moved away from the 
GRE in recent years. In 2018 alone, 44% of 
the country’s top molecular biology pro- 
grams dropped the GRE as an application 
requirement, according to an investigation 
by Science (31 May 2019, p. 816). Dubbed 
“GRExit,” the movement has been fueled 
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At-home GRE test takers must have a computer, 
private room, and stable internet connection. 


by concerns that the GRE doesn’t pre- 
dict student success in graduate school, 
and that its use in admissions decisions 
disadvantages applicants from  under- 
represented groups. 

Delia Shelton, a Black psychologist at 
the University of Tennessee, Knoxville, 
says when she was applying to graduate 
schools, taking the GRE was a hardship; 
it was expensive and she had to drive 
2 hours to her testing location. Her scores 
prevented her from applying to some pro- 
grams that specified a cutoff for applicants. 
Yet she’s done well for herself in academia, 
winning a prestigious fellowship from the 
U.S. National Institutes of Health and an 
assistant professorship at her university. 
She and her colleagues voted earlier this 
month to eliminate the GRE from admis- 
sions requirements. The scores “don’t 
speak to how well you can do in graduate 
school,” she says. 

“The socioeconomic constraints [of] 
standardized testing ... [are] well docu- 
mented, and I think this at-home test exac- 
erbates some of those,’ agrees Joshua Hall, 
director of admissions for the biological 
and biomedical science program at the Uni- 
versity of North Carolina, Chapel Hill, who 
maintains a list of roughly 300 life science 
programs that have joined GRExit. Hall es- 
timates that 15 programs have contacted 
him since the start of the pandemic asking 
to be added to his list. 

Acereda emphasized that GRE scores 
can be valuable as part of a holistic review 
process, one that looks at reference letters, 
essays, and other application materials. 
He acknowledged that some prospective 
students might be unable to take the test 
online, but he added that more than 1000 
test centers have already reopened. “As the 
world continues to reopen after COVID, test 
takers will have greater choice regarding 
where they would like to test.” 

This year, however, some simply gave up. 
Natasha Hodges says she ran into problems 
when she couldn’t install the proctoring 
software on her Apple laptop. “No matter 
how many people I chatted with, or how 
many times I’ve called or emailed them, no 
one can explain to me or even address [my 
problem],” she says. But after failing to re- 
solve her technical issues, she was pleased 
to discover that many of the microbiology 
Ph.D. programs she wants to apply to have 
waived the GRE as an application require- 
ment. “It ends up working out that I don’t 
end up having to take it anyway,’ she says. & 


Jane C. Hu is a science journalist in Seattle. 
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SCIENCE AND SECURITY 


Senate bill to curb foreign threats raises alarms 


Sweeping changes to protect research could hamper academic collaboration, critics say 


By Jeffrey Mervis 


bipartisan group of U.S. senators 

last week proposed sweeping—and 

controversial—changes in how the fed- 

eral government manages academic 

research in the face of foreign threats. 

The authors of the legislation, more 

than 1 year in the making, claim it will stop 

China and other countries from stealing 

the fruits of federally funded research with- 

out weakening a system that has made the 

United States a global leader in innovation. 

But research advocates worry the bill, if en- 

acted, would restrict the exchange of talent 
and ideas. 

Drafted by Senators Rob Portman 
(R-OH) and Tom Carper (D-DE) 
and with eight Republican and five 
Democratic co-sponsors, the Safe- 
guarding American Innovation Act 
is the latest and most substantive 
attempt in Congress to reconcile 
scientific security and openness. 
One contentious provision would 
give the State Department grounds 
to reject a visa application from 
anyone with ties to a foreign gov- 
ernment seen as hostile to the 
United States. Critics worry such 
power could be used to keep out 
the tens of thousands of Chinese 
graduate students and postdocs 
who seek to study in the United 
States each year. 

Other provisions would expose 
scientists who fail to disclose ties to for- 
eign governments to criminal penalties 
including jail time, require international 
partners to embrace U.S. scientific norms, 
lower the size of foreign gifts that universi- 
ties must report, and give the White House 
budget office new powers to oversee re- 
search security. 

“For nearly 2 decades, the federal gov- 
ernment has been asleep at the wheel 
while foreign governments have exploited 
the lack of transparency in our education 
system and bought access and influence,” 
says Portman, who leads the Permanent 
Subcommittee on Investigations, which 
has issued several reports sharply critical 
of current federal efforts. “This bill will 
help us stop foreign governments from 
stealing our research and innovation.” 

The sponsors don’t hide their intended 
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target. “America’s research enterprise 
is the best in the world and the Chinese 
Communist Party knows it,” says Senator 
Josh Hawley (R-MO). “That’s why they’ve 
spent the last 20 years stealing American 
taxpayer-funded intellectual property.” 

Carper, the top Democrat on the investiga- 
tive panel and a co-sponsor, uses more judi- 
cious language. The legislation, he says, is a 
“common sense [approach] to protect Amer- 
ican intellectual property and better leverage 
our international research partnerships.” 

A Portman press release claims “wide- 
spread support” from academia. But all of 
the supportive statements come from in- 
stitutions in his home state of Ohio, and 


Senator Rob Portman (R-OH) targets China in new legislation. 


many also hint at the need to tweak some 
of its provisions. “We endorse [its] goals to 
modernize the safety and security of our 
nation and we look forward to continuing 
to collaborate with Sen. Portman as the 
legislation moves forward,” says Barbara 
Snyder, who is stepping down this fall as 
president of Case Western Reserve Univer- 
sity in Cleveland to lead the Association of 
American Universities (AAU), a coalition 
of 65 major research institutions. 

In private, research advocates express 
grave reservations. “It violates the culture 
of openness that is fundamental to aca- 
demic research,” one says. “I don’t think 
the higher education community is going 
to like any of this,” says another, who, like 
many advocates, requested anonymity be- 
cause they were not authorized to speak 
for their organization. 
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Although the bill’s language is subtle, 
it contains “key provisions ... [that] are 
overly broad and will only serve to harm 
American science without improving na- 
tional security,’ says AAU’s Tobin Smith. 
One such provision, Smith and others say, 
would give the State Department the au- 
thority to reject a visa application from 
anyone based on their “cooperation with 
... military organizations adversarial to the 
United States, foreign institutions involved 
in the theft of United States research, [or] 
a government that seeks to undermine the 
integrity and security of the United States 
research community.” 

But staffers on the subcommittee that 
Portman leads say the language 
would apply to fewer researchers 
than the critics fear. “The focus 
of the bill is on bad actors,’ one 
staffer noted. “The vast majority 
of foreign researchers [asking to 
come to the United States] are be- 
nign, and we need their talents.” 

The bill would also empower 
the State Department to reject or 
restrict the activities of a visa ap- 
plicant if officials decide that giv- 
ing the applicant access to “goods, 
technologies, or sensitive infor- 
mation” would harm the United 
States. Extensive rules already 
limit the sharing and export of re- 
search products deemed sensitive. 
But lobbyists say the new provision 
could require universities to im- 
pose additional restrictions on visa hold- 
ers, such as blocking them from attending 
open lectures or visiting laboratories doing 
unclassified research. 

Again, the staffers accuse the research 
community of overreacting. “We're not 
locking down campuses,” one staffer says. 
But universities and other federally funded 
institutions “don’t need to give everyone 
access to everything.” 

There is no companion bill in the House 
of Representatives, and the House’s Demo- 
cratic leadership is more skeptical that 
the problem of foreign influence warrants 
wholesale legislative changes. Little time 
remains for the Senate to act on the leg- 
islation before the November elections. 
But research advocates expect the debate 
in Congress to continue regardless of the 
outcome of the vote. 
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MARTIAN Chik 


NASAs Perseverance rover aims to find out whether 
ancient Mars was warm and wet or cold-and dry sy PautVoosen 


ASA’s newest Mars rover, Persever- 
ance, is going back in time to the 
bottom of a vanished lake. If all 
goes well, in February 2021 it will 
land in Jezero crater and pop the 
dust covers off its camera lenses. 
Towering in front of it, in all like- 


lihood, will be a 60-meter cliff of 
mudstone: the edge of a fossil- 
ized river delta. These lithified martian 
sediments could hold answers to urgent 
questions about the earliest days of Earth’s 
chilly, parched neighbor: How did this pint- 
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size planet, so distant from a faint young 
Sun, support liquid water on its surface? 
How much water was there, and how long 
did it persist? And did Mars ever spawn life? 

The 45-kilometer-wide crater is an in- 
triguing target. Billions of years ago, when 
life was just beginning on Earth, water 
broke through its western rim and spilled 
into its interior, carrying sediments that 
settled and piled up in thick, meandering 
braids that today can be seen from space, 
as plain as day. “It’s kind of like the Mis- 
sissippi delta, but smaller,’ says Raymond 
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Arvidson, a planetary geologist at Wash- 
ington University in St. Louis. The wa- 
ter filled the crater like a bathtub until, 
250 meters deep, it breached the eastern 
rim. And then, just as mysteriously as it ar- 
rived, the water disappeared. 

Scientists have traced the tracks of ancient 
water across Mars ever since the 1970s, when 
orbiters revealed branching valley networks 
that matched the dendritic shape of water- 
eroded valleys on Earth. In the 1990s, the 
Mars Global Surveyor zoomed in on deeply 
incised gullies that could only have been 
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Perseverance is thermally 
tested in artificial sunlight. 
It will explore more 

than 500 million years 

of martian climate. 


carved by powerful flows of water—and may 
even have glimpsed shorelines from an an- 
cient ocean. Later orbiters found evidence 
of abundant clay-bearing minerals that 
need water to form. More recently, the Cu- 
riosity rover, Perseverance’s predecessor, has 
charted the existence of a long-lived lake at 
the bottom of its adopted home, Gale crater. 

Some scientists believe the water shows 
ancient Mars was warm for millions of years, 
a favorable climate for life to emerge. Others 
say the climate was cold and dry, punctuated 
by sporadic bursts of water that only lasted 
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for hundreds or thousands of years—a much 
more difficult environment for life to take 
root. Along with the question of past life, says 
Ken Farley, the mission’s project scientist and 
a geologist at the California Institute of Tech- 
nology (Caltech), Mars’s ancient climate “is 
the biggest unanswered question.” 
Perseverance will tackle both questions, 
although the search for life will take longer. 
The rover, developed by NASA’s Jet Propul- 
sion Laboratory (JPL) and set for launch 
next month from Cape Canaveral Air Force 
Station in Florida, is also the start of an au- 
dacious campaign that will ferry to Earth 
about 30 samples of martian rock and grit. 
Perseverance will gather the samples, and 
NASA and the European Space Agency 
(ESA) are designing two follow-up missions 
to retrieve them, aiming for launches in 
2026 (Science, 22 November 2019, p. 932). 
The complex mechanisms needed to drill 
and store these cores limited the room on 
board for tools to chemically analyze sam- 
ples and look for organic molecules. Until 
the samples reach labs on Earth, the ques- 
tion of whether life once existed in Jezero 
will probably go unanswered. “We'll have to 
be patient,” says Tanja Bosak, a geobiologist 
at the Massachusetts Institute of Technology 
and member of the rover’s science team. 
The story of the martian climate, on the 
other hand, will be etched across Jezero’s sur- 
face, visible to an array of rover instruments. 
Scientists can only make a rough guess 
at the lake’s age, but they think it formed 
3.8 billion years ago, about the same time as 
the valley networks, over hundreds or thou- 
sands of years. Unlike Curiosity’s target, Gale 
crater, which offers a snapshot of a moment 
some 3.5 billion years ago when Mars was 
likely drying out, Jezero and its surround- 
ings will grant access to more than 500 mil- 
lion years of martian history, including some 
of the planet’s oldest terrain, says Bethany 
Ehlmann, a Caltech planetary scientist and 
member of the science team. “We have the 
potential for a really rich history of climate.” 


BY DESIGN, PERSEVERANCE borrows much 
from Curiosity: a six-wheeled chassis the size 
of a small SUV, an imaging turret, a radio- 
isotope power source. “From the outside it 
looks the same,” says Allen Chen, one of the 
rover’s lead engineers at JPL. “But it’s got it 
where it counts.” That includes advanced new 
imaging instruments, landing capabilities, 
and a complex drilling system—innovations 
that led its budget, originally pitched as a 
bargain at $1.5 billion, to balloon and end up 
matching Curiosity’s $2.7 billion price tag, 
which includes operations. 

To analyze samples in its onboard lab, 
Curiosity’s drill only needed to pulverize 
rock. Perseverance, in contrast, must drill 
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intact cores, each about the size of a thick 
piece of school chalk, and store them within 
titanium tubes. The system also has to keep 
the cores safe and clean, to prevent Earth- 
borne microbes and molecules from being 
mistaken for martian ones when the cores 
finally arrive back on Earth. In the end, 
engineers dreamed up a system involving 
two robotic arms, nine drill bits, 43 sample 
tubes, and a rotating carousel. “When you 
look at it, you won’t think of it being sim- 
ple,’ says Adam Steltzner, the rover’s chief 
engineer at JPL, “but it was the simplest we 
could imagine.” 

Building and testing that system nearly 
delayed a mission straining to meet tight 
deadlines. In October 2019, engineers dis- 
covered the tubes seized up inside the drill 
bit when tested in martian conditions. “For 
me it was a moment of despair,’ Farley says. 
“How were we ever going to fix this?” 

The problem, it turned out, was that the 
rover was too clean. The tubes had been 
baked at 350°C for 1 hour, which not only 
sterilized them, but also vaporized a hydro- 
carbon film. The team hadn’t realized that 
the film, a patina that forms on nearly any 
metal exposed to Earth’s atmosphere, was 
needed as a grease. After several stressful 
months, they developed a cleaning routine 
that limited the baking to 150°C and in- 
cluded a series of chemical washes. That 
left a small amount of the film on the 
outside of the tubes but no trace inside, 
where it might contaminate samples. “We 
leave nothing behind, like a good hiker,” 
Steltzner says. 

After the issue was resolved, another 
mote of organic material began to threaten 
the mission’s launch. By February, the 
coronavirus pandemic had postponed the 
launch of ESA’s Rosalind Franklin rover, 
which already had parachute problems, un- 
til 2022, the next Mars launch window. De- 
termined to hit its window, NASA shuttled 
a skeleton crew to and from Florida for the 
rover’s final inspections, while most JPL en- 
gineers did what they could from their Cali- 
fornia homes. “It’s fascinating how much of 
it you can do from your living room,” says 
Jennifer Trosper, the rover’s deputy project 
manager for surface operations. “We're used 
to remote operation. We just had to move it 
back a little earlier.” 

In May, with the rover already stacked on 
the spacecraft that would ferry it to Mars, a 
C-130 transport plane delivered the cleaned 
sample tubes, quarantined in nitrogen- 
filled cases, to Cape Canaveral. Engineers 
loaded the tubes just before a heat shield 
sealed the rover within its landing capsule. 
A last-minute arrival of the tubes was al- 
ways the plan to limit contamination risks. 
Also, Trosper adds, “We just finished them.” 
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On 20 July, a 3-week launch window 
opens up. Seven months after an Atlas V 
rocket puts it on a path to Mars, the rover 
will plunge through the barely there mar- 
tian atmosphere. Just as for Curiosity, a 
“sky crane” hovering on retrorockets will 
unspool Perseverance on a tether and 
lower it to the ground. But there’s an im- 
portant improvement: A camera on the 
rover’s belly will assess the landscape as it 
descends and compare it to a stored map of 
safe landing spots. The sky crane will fire 
its thrusters to divert to one of these zones, 
enabling the rover to land far closer to its 
target than Curiosity did in 2012, in 
a nearly circular, 8-kilometer-wide 
landing ellipse at the delta’s edge. 


FROM THAT MOMENT it will be a player 
in what Nature Geoscience dubbed 
a “war” over Mars’s ancient climate. 
What Curiosity saw at Gale crater 
convinced some geologists that an- 
cient Mars remained warm for mil- 
lions of years. Sediments probably 
built up more slowly on Mars than on 
Earth, so the thick sediments at Gale 
suggested “this lake almost certainly 
existed for tens of millions of years, 
maybe longer,’ says John Grotzinger, 
the Caltech geologist who led Curios- 
ity’s science for its first few years. 

If so, the lake would have endured 
climate variations driven by chaotic 
wobbles in the planet’s tilt, which 
varies from 10° to 60° Something 
must have kept the planet warm 
while the lake shifted between tropical and 
arctic latitudes. “Did we land in one weirdo 
place on Mars? Probably not,’ Grotzinger 
says. But what warmed the climate is a 
mystery, he admits. “Something is miss- 
ing, and we don’t know what that is yet.” 

To the opposing camp, that’s grounds 
for skepticism about a warm early Mars. 
In 1991, James Kasting, a planetary scien- 
tist at Pennsylvania State University, Uni- 
versity Park, reported that an atmosphere 
of carbon dioxide (CO,) and water vapor, 
both greenhouse gases, was not capable of 
keeping the ancient planet wet and warm 
for millions of years. The atmosphere would 
have been too thin, and the early Sun too 
weak. Mars “must have had a phenomenal 
greenhouse effect,’ Arvidson says, double 
what exists now on Earth. To this day, 
even with more sophisticated models, “The 
climatologists haven’t figured out how to do 
it,” he adds. 

That has led these scientists to argue that 
martian water flowed in bursts lasting just 
thousands of years—brief exclamations in 
an eternal deep freeze. That is a Mars that 
climate models can simulate, says Robin 
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Sample tubes, baked a 


Wordsworth, a planetary scientist at Har- 
vard University. Its ancient volcanoes could 
have belched a lot of hydrogen, a strong but 
short-lived greenhouse gas. Periodic bursts 
of water could have rusted iron-bearing 
minerals, releasing more hydrogen to the 
air. Or asteroid strikes, more common in 
that era, could have released hydrogen if 
they hit regions rich in ice or subsurface 
water. “For all of them you can make epi- 
sodic warming work,’ Wordsworth says. 
“But not warm and wet.” 

The rocks in Gale crater can also support 
this view, Ehlmann says. They lack certain 


minerals that should be present if they 
were exposed to water for 1 million years 
or more. Jim Bell, a planetary scientist at 
Arizona State University, Tempe, has con- 
cluded that ancient Mars was probably like 
Antarctica, icy and dry, with spurts of melt. 
“More Earth-like does not mean like most 
of the Earth.” 


PERSEVERANCE WILL NEED the head start 
pro-vided by a precise landing to try to 
settle the issue. During its 2-year primary 
mission, it will take advantage of upgraded 
wheels and autonomous navigation capa- 
bilities to briskly traverse more than 15 kilo- 
meters—a distance Curiosity took more 
than 4 years to cover. The rover will collect 
its first 20 samples for an eventual return 
mission from the geologically diverse ter- 
rain it will cross. 

The first samples are likely to be rocks 
thought to come from an eruption that cov- 
ered parts of the crater after the lake dried 
up. Volcanic rocks contain trace radio- 
active elements that decay at a certain rate, 
a clock that lab scientists on Earth can use 
to date the eruption, putting a lower limit 
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ind washed clean of microbes, were the last 
rover parts installed. NASA aims to return about 30 to Earth. 


on the age of the lake. Mission scientists also 
hope to find outcrops of older volcanic rocks 
that sit below the delta mudstones, marking 
an eruption that occurred before the water 
arrived. Those would provide an upper age 
limit, making it possible to roughly bracket 
the lake’s existence. “When were these 
habitable environments in absolute time, 
and how quickly did they come and go?” 
Ehlmann asks. 

As the rover rolls along the lake bottom, 
a ground-penetrating radar mounted on its 
belly will fire, recording echoes that reveal 
the textures of sediment up to 10 meters 
below the surface. “We'll be creat- 
ing a giant ribbon of data,’ says Da- 
vid Paige, a planetary scientist at the 
University of California, Los Angeles, 
and the instrument’s deputy prin- 
cipal investigator. The reflections 
could help determine whether the 
lake was open water or covered in ice. 
Fine mud would suggest open water; 
anomalously large stones would sug- 
gest ice, which could have carried 
them to the middle of the lake before 
dropping them. 

From there the rover will visit the 
fine-grained clay-bearing mudstones 
of the lower parts of the delta. Here 
the hunt for past life will take the 
lead. On Earth, such clays blanket 
living things and preserve them as 
fossils. In similar clays at Gale crater, 
Curiosity scientists detected traces 
of complex organic compounds that 
resembled kerogen, the feedstock of 
oil. But they could not determine whether 
the compounds, detected at levels of a few 
dozen parts per million, were produced by 
ancient life, or deposited on the martian 
surface by meteorites, which often contain 
complex organic molecules. 

Two instruments mounted at the end 
of Perseverance’s main robotic arm may 
help tell the difference. One will fire an 
ultraviolet laser at the rocks; the other will 
bombard them with x-rays. The radiation 
re-emitted by atoms in the rocks could re- 
veal organic chemistry. Mapping any or- 
ganics in a rock could also say something 
about their origin. A uniform signal would 
favor meteoritic fallout, whereas a lumpier 
distribution, and the presence of miner- 
als that hint at microbe-fueling reactions, 
could be a sign of life—and a green light to 
drill a sample. 

As the rover forges a path up the delta, 
the fine mudstone will give way to rough 
sandstone. The team will keep an eye out 
for exposures of opal-like rocks that have re- 
cently been spotted from space. Opal forms 
from a solution of silica and water, and on 
Earth the deposits are classic fossil-hunting 
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A trip through time 


- Next month, NASA's Perseverance rover will head to Mars to explore the remains of a muddy riverdelta 

\— more than 3 billion years old. Scientists don’t know whether the water came during a brief thaw on acold, dry planet 
C or ina lasting period of warmth. The rover’s path, crossing more than 500 million years of geologic history, 

* could help resolve the debate. Graphic by Chris Bickel 
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spots. That’s because the mineral creeps 
into organic layers and preserves fossil 
structures, Bosak says. “That’s where we 
find the most beautifully preserved micro- 
bial mats.” The rover’s cameras will search 
for such structures, but Bosak doubts they 
will be seen—even on Earth, they are not 
often apparent until polished in the lab. 

The sand grains, washed in by the long- 
lost river, could also say something about 
what caused Mars’s early warmth—whether 
steady or intermittent—to dissipate after 
the delta formed. Some of the sand grains, 
eroded from volcanic rocks, will contain 
radioactive isotopes that make it possible 
to date them. Scientists on Earth will also 
examine certain minerals to look for the 
frozen imprint of a magnetic field. Mars is 
believed to have had a magnetic field early 
in its history, generated by a molten dynamo 
in the planet’s interior. The field would have 
failed as the dynamo cooled and shut down, 
and some believe that explains Mars’s radi- 
cal change in climate. A weakening field 
could have allowed charged particles from 
the Sun to erode the planet’s once-thick 
atmosphere. Water would have escaped to 
space, making the planet colder and drier. 
Magnetic signatures teased out of the sand 
grains could show whether the decline of 
the field preceded—and perhaps caused— 
the climate change. 


AFTER NEARLY 2 YEARS of frantic drilling, the 
rover will climb one of the delta’s fingers 
to reach the shores of Jezero’s paleolake, 
fast against the crater’s edge. Orbiters have 
spotted a bathtub ring of carbonate rocks 
running around the crater rim in a narrow 
band, likely where the lake’s warm shallows 
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Microbes in Turkey's Lake Salda create layered mounds of carbonate called stromatolites. Jezero crater may have hosted a similar environment billions of years ago. 


were. On Earth, such deposits are known 
to preserve fossilized stromatolites, bumpy 
cauliflowerlike mounds formed by the 
growth of bacteria. “They are an ideal place 
to look for past life,” says Briony Horgan, 
a planetary scientist at Purdue Univer- 
sity. That is, she says, if the deposits were 
formed by the lake, and not by hot water 
created by the crater-forming impacts. 

If the lake was responsible for the car- 
bonate deposits, they will offer a window 
on the ancient martian atmosphere, which 
supplied the CO, that formed them. By 
comparing carbon isotopes from carbon- 
ates in the bathtub ring and in older rocks 
outside the crater, scientists could learn 
how levels of atmospheric CO,—and the 
greenhouse effect it drives—changed over 
this time. 

The outcrops around these shallows, 
near the entry point of the river, could 
also betray something about the climate 
at the time the delta was laid down, says 
Timothy Goudge, a planetary scientist at 
the University of Texas, Austin. Layering in 
the outcrops will reveal how much water 
was needed to form the delta, how long it 
flowed, and whether it came in the brief 
floods or steady flows. Cracks in river bot- 
tom rocks could be wedges opened up by 
continuous freeze-thaw cycles—a sign of 
persistent frigid conditions. 

The shallows will likely mark the end of 
the primary mission. But the rover ought to 
have many more years left on its odometer. 
Engineers want the rover, while still healthy, 
to drop some samples on flat, accessible 
terrain where a later mission can retrieve 
them. But it may drill some sites twice, and 
it will continue to collect—in case the rover 
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itself is healthy enough to deliver samples 
to the Earth return mission. 

After the primary mission, many on 
the team will be eager to escape the delta 
for the ancient, mysterious terrain to the 
west. Deposits of clay and carbonate seen 
there also needed water to form. If bands 
of water-weathered rock capped by water- 
deposited sediments are visible on its me- 
sas, a temperate climate may have prevailed. 
Alternatively, as Ehlmann and others be- 
lieve, this landscape could be what’s left of 
an icy subsurface that was heated by nearby 
giant impacts 4 billion years ago and turned 
into an underground hydrothermal system 
capable of fostering life. “That would point 
to an ancient Mars that was habitable, but 
not so warm,” she says. 

Whatever answers the rover finds, it 
will mark the end of an era on Mars. For 
decades, NASA has dominated exploration 
of the planet’s surface, culminating in the 
increasingly ambitious rovers of the past 
2 decades. “We’ve had the privilege and 
responsibility to do a systematic investiga- 
tion of a planet,” says Jim Watzin, director 
of NASA’s Mars Exploration Program. Other 
nations will soon add their rovers, starting 
with China this year (see sidebar, p. 1420). 

But Perseverance also kicks off a new era. 
The sample return effort it anchors “will 
be the first round trip to another planet 
by humanity,’ Watzin says. Bringing Mars 
to Earth will enable scientists to probe the 
secrets of the Red Planet more deeply. If 
humans follow in the rover’s tracks in the 
coming decades, as the United States and 
China have vowed, the terrain they encoun- 
ter may seem strange. But it will be famil- 
iar ground. 
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Mars mission would put China among space leaders 


By Dennis Normile 


ASA's Perseverance rover may have 

company on the Red Planet. China 

aims to leap to the front ranks in plan- 

etary exploration with an ambitious 

Mars mission, its first independent 
bid to reach the planet. Tianwen-l1—“quest 
for heavenly truth”—consists of not only 
an orbiter, but also a lander and a rover, a 
trifecta no other nation has accomplished 
on its first Mars bid. “A successful landing 
would put China among elite company,” 
says Mason Peck, an aerospace engineer 
at Cornell University. 


or rover missions to Mars, only 10 have 
been successful. Nine of those 10 were 
NASA missions. A Russian probe landed 
successfully, but almost immediately lost 
communications. 

Scientists involved in Tianwen-l said 
they did not have permission from the 
China National Space Administration 
(CNSA) to speak to the press, and the 
agency did not respond to questions. 
Although state media have run stories 
about the mission, there is nothing like 
the fanfare that accompanies a NASA 
Mars landing. Several sources within 
China's space community believe the 


The as-yet-unnamed Tianwen-1 rover is the size of a small golf cart and one-quarter the weight of NASA's 
Perseverance. The solar-powered rover is expected to operate for at least 90 days. 


Due to launch in July, the mission, if suc- 
cessful, would mark dramatic progress for 
China's space program. In recent years it 
has fielded several lunar landers but made 
only one attempt on Mars, an orbiter that 
piggybacked on a failed 2011 Russian mis- 
sion to the martian moon Phobos. 

A Mars landing is among the most 
challenging feats in spaceflight. Unlike the 
Moon, Mars has an atmosphere, which 
means landers need protection from the 
heat generated during descent. But its 
air is too thin for a parachute alone to 
slow a lander; retrorockets are needed 
as well. And the entire sequence must 
be executed autonomously. Of 18 lander 
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agency is muting publicity to temper 
expectations for a risky mission. 

China has not yet announced which 
of two candidate landing sites it prefers. 
Both are flat, smooth plains not far from 
where NASA's Viking 1 and Viking 2 landers 
touched down in 1976. The low-lying sites 
give the lander’s parachute more time 
to work. Although scientists might have 
preferred a more rugged site at higher 
elevations with more interesting geology, “| 
speculate [CNSA engineers] are looking to 
particularly demonstrate a safe landing,” 
says Jim Bell, a planetary scientist at Ari- 
zona State University, Tempe, and veteran 
of several Mars rover missions. 
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Landing is not the only objective, how- 
ever. “Our goal is to explore and gather as 
much scientific data as possible,” CNSA 
chief mission architect Zhang Rongqiao 
said during a July 2019 lecture on the 
mission. The orbiter aims to study the 
martian magnetic field and atmosphere. 
With a high-resolution camera, it will map 
the surface and characterize its geology. 

The as-yet-unnamed, 240-kilogram 
rover, the size of a small golf cart and one- 
quarter the weight of Perseverance, carries 
six scientific instruments. Among them is 
a ground-penetrating radar (GPR) that, 
along with one on Perseverance, will be the 
first such devices on Mars, able to map 
subsurface features that orbiting radars 
see only dimly. “You can really investigate 
layering, structures, and the presence of 
permafrost or ice,” says Elena Pettinelli, a 
geophysicist at Roma Tre University, who 
has helped analyze GPR data from China’s 
Chang’e 3 and 4 missions to the Moon. 

Tianwen-1 will take 7 months to reach 
Mars, and it will be several more months 
before the orbiter releases the lander, 
according to a 2017 paper outlining the 
mission in Science China Technological 
Sciences. After trundling off a ramp on the 
lander, the solar-powered rover is expected 
to operate for at least 90 martian days, us- 
ing the orbiter as a communications relay. 
The orbiter will keep going for about one 
martian year, or roughly 23 months. 

Dean Cheng, a China policy expert at 
the Heritage Foundation, a conservative 
U.S. think tank, says beyond demonstrat- 
ing technological prowess, China wants to 
contribute “to the global pool of knowl- 
edge.” It believes “great powers are also 
scientific powers,” he says. 

Tianwen-l is not the only upcom- 
ing demonstration of those ambitions. 
Later this year, China plans to launch its 
Chang’e 5 mission, which would return 
the first Moon rocks since the last Soviet 
Union Luna mission in 1976; it will likely 
attempt a far-side sample return mission 
after that. 

CNSA officials have suggested that if 
Tianwen-l and Chang’e 5 go well, China 
could attempt to return samples from 
Mars beginning around 2030. That 
timeline puts it on the heels of the NASA— 
European Space Agency sample return 
mission—but not by much. 


With reporting by Bian Huihui. 
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PERSPECTIVES 


VIEWPOINT: COVID-19 


tay ei 


A, Correctly fitted masks are 
an important tool to reduce 
, airborne transmission of 


SARS-CoV-2, particularly in 
enclosed spaces, such as on this 
Moscow Metro train in Russia. 


Reducing transmission of SARS-CoV-2 


Masks and testing are necessary to combat asymptomatic spread in aerosols and droplets 


By Kimberly A. Prather’, Chia C. Wang?°, 
Robert T. Schooley* 


espiratory infections occur through 
the transmission of virus-containing 
droplets (>5 to 10 um) and aerosols 
(<5 um) exhaled from infected indi- 
viduals during breathing, speaking, 
coughing, and sneezing. Traditional 
respiratory disease control measures are 
designed to reduce transmission by drop- 
lets produced in the sneezes and coughs of 
infected individuals. However, a large pro- 


1422 26 JUNE 2020 + VOL 368 ISSUE 6498 


portion of the spread of coronavirus disease 
2019 (COVID-19) appears to be occurring 
through airborne transmission of aerosols 
produced by asymptomatic individuals dur- 
ing breathing and speaking (J—3). Aerosols 
can accumulate, remain infectious in in- 
door air for hours, and be easily inhaled 
deep into the lungs. For society to resume, 
measures designed to reduce aerosol trans- 
mission must be implemented, including 
universal masking and regular, widespread 
testing to identify and isolate infected 
asymptomatic individuals. 
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Humans produce respiratory droplets 
ranging from 0.1 to 1000 um. A competi- 
tion between droplet size, inertia, gravity, 
and evaporation determines how far emit- 
ted droplets and aerosols will travel in air (4, 
5). Larger respiratory droplets will undergo 
gravitational settling faster than they evapo- 
rate, contaminating surfaces and leading to 
contact transmission. Smaller droplets and 
aerosols will evaporate faster than they can 
settle, are buoyant, and thus can be affected 
by air currents, which can transport them 
over longer distances. Thus, there are two 
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major respiratory virus transmission path- 
ways: contact (direct or indirect between 
people and with contaminated surfaces) and 
airborne inhalation. 

In addition to contributing to the extent of 
dispersal and mode of transmission, respira- 
tory droplet size has been shown to affect the 
severity of disease. For example, influenza vi- 
rus is more commonly contained in aerosols 
with sizes below 1 um (submicron), which 
lead to more severe infection (4). In the case 
of severe acute respiratory syndrome corona- 
virus 2 (SARS-CoV-2), it is possible 
that submicron virus-containing 
aerosols are being transferred deep 
into the alveolar region of the lungs, 
where immune responses seem to 
be temporarily bypassed. SARS- 
CoV-2 has been shown to replicate 
three times faster than SARS-CoV-1 
and thus can rapidly spread to the 
pharynx, from which it can be shed 
before the innate immune response 
becomes activated and produces 
symptoms (6). By the time symp- 
toms occur, the patient has trans- 
mitted the virus without knowing. 

Identifying infected individuals 
to curb SARS-CoV-2 transmission 
is more challenging compared to 
SARS and other respiratory viruses 
because infected individuals can 
be highly contagious for several 
days, peaking on or before symp- 
toms occur (2, 7). These “silent 
shedders” could be critical drivers 
of the enhanced spread of SARS- 
CoV-2. In Wuhan, China, it has been 
estimated that undiagnosed cases 
of COVID-19 infection, who were 
presumably asymptomatic, were 
responsible for up to 79% of viral 
infections (3). Therefore, regular, 
widespread testing is essential to identify and 
isolate infected asymptomatic individuals. 

Airborne transmission was determined to 
play a role during the SARS outbreak in 2003 
(1, 4). However, many countries have not yet 
acknowledged airborne transmission as a 
possible pathway for SARS-CoV-2 (7). Recent 
studies have shown that in addition to drop- 
lets, SARS-CoV-2 may also be transmitted 
through aerosols. A study in hospitals in 
Wuhan, China, found SARS-CoV-2 in aerosols 
further than 6 feet from patients, with higher 
concentrations detected in more crowded 
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Particle size (4m) 


Infected, asymptomatic 


areas (8). Estimates using an average sputum 
viral load for SARS-CoV-2 indicate that 1 min 
of loud speaking could generate >1000 vi- 
rion-containing aerosols (9). Assuming viral 
titers for infected super-emitters (with 100- 
fold higher viral load than average) yields 
an increase to more than 100,000 virions in 
emitted droplets per minute of speaking. 
The U.S. Centers for Disease Control and 
Prevention (CDC) recommendations for so- 
cial distancing of 6 feet and hand washing to 
reduce the spread of SARS-CoV-2 are based 


Masks reduce airborne transmission 
Infectious aerosol particles can be released during breathing and 
speaking by asymptomatic infected individuals. No masking maximizes 
exposure, whereas universal masking results in the least exposure. 


@e- 
—I . a 1 
100 10 1 O11 


Healthy 


Minimum 
exposure 


on studies of respiratory droplets carried 
out in the 1930s. These studies showed that 
large, ~100 um droplets produced in coughs 
and sneezes quickly underwent gravitational 
settling (7). However, when these studies 
were conducted, the technology did not exist 
for detecting submicron aerosols. As a com- 
parison, calculations predict that in still air, a 
100-1m droplet will settle to the ground from 
8 feet in 4.6 s, whereas a 1-um aerosol par- 
ticle will take 12.4 hours (4). Measurements 
now show that intense coughs and sneezes 
that propel larger droplets more than 20 feet 
can also create thousands of aerosols that can 
travel even further (7). Increasing evidence 
for SARS-CoV-2 suggests the 6 feet CDC rec- 
ommendation is likely not enough under 
many indoor conditions, where aerosols can 
remain airborne for hours, accumulate over 
time, and follow airflows over distances fur- 
ther than 6 feet (5, 0). 
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In outdoor environments, numerous fac- 
tors will determine the concentrations and 
distance traveled, and whether respiratory 
viruses remain infectious in aerosols. Breezes 
and winds often occur and can transport in- 
fectious droplets and aerosols long distances. 
Asymptomatic individuals who are speaking 
while exercising can release infectious aero- 
sols that can be picked up by airstreams (10). 
Viral concentrations will be more rapidly 
diluted outdoors, but few studies have been 
carried out on outdoor transmission of SARS- 
CoV-2. Additionally, SARS-CoV-2 
can be inactivated by ultraviolet ra- 
diation in sunlight, and it is likely 
sensitive to ambient temperature 
and relative humidity, as well as 
the presence of atmospheric aero- 
sols that occur in highly polluted 
areas. Viruses can attach to other 
particles such as dust and pollution, 
which can modify the aerodynamic 
characteristics and increase disper- 
sion. Moreover, people living in ar- 
eas with higher concentrations of 
air pollution have been shown to 
have higher severity of COVID-19 
(11). Because respiratory viruses can 
remain airborne for prolonged pe- 
riods before being inhaled by a po- 
tential host, studies are needed to 
characterize the factors leading to 
loss of infectivity over time in a va- 
riety of outdoor environments over 
a range of conditions 

Given how little is known about 
the production and airborne be- 
havior of infectious respiratory 
droplets, it is difficult to define a 
safe distance for social distancing. 
Assuming SARS-CoV-2 virions are 
contained in submicron aerosols, 
as is the case for influenza virus, a 
good comparison is exhaled cigarette smoke, 
which also contains submicron particles and 
will likely follow comparable flows and dilu- 
tion patterns. The distance from a smoker at 
which one smells cigarette smoke indicates 
the distance in those surroundings at which 
one could inhale infectious aerosols. In an 
enclosed room with asymptomatic individu- 
als, infectious aerosol concentrations can 
increase over time. Overall, the probability 
of becoming infected indoors will depend 
on the total amount of SARS-CoV-2 inhaled. 
Ultimately, the amount of ventilation, num- 
ber of people, how long one visits an indoor 
facility, and activities that affect airflow will 
all modulate viral transmission pathways 
and exposure (J0). For these reasons, it is im- 
portant to wear properly fitted masks indoors 
even when 6 feet apart. Airborne transmis- 
sion could account, in part, for the high sec- 
ondary transmission rates to medical staff, as 
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well as major outbreaks in nursing facilities. 
The minimum dose of SARS-CoV-2 that leads 
to infection is unknown, but airborne trans- 
mission through aerosols has been docu- 
mented for other respiratory viruses, includ- 
ing measles, SARS, and chickenpox (4). 

Airborne spread from undiagnosed infec- 
tions will continuously undermine the effec- 
tiveness of even the most vigorous testing, 
tracing, and social distancing programs. After 
evidence revealed that airborne transmis- 
sion by asymptomatic individuals might be a 
key driver in the global spread of COVID-19, 
the CDC recommended the use of cloth face 
coverings. Masks provide a critical barrier, 
reducing the number of infectious viruses 
in exhaled breath, especially of asymptom- 
atic people and those with mild symptoms 
(12) (see the figure). Surgical mask mate- 
rial reduces the likelihood and severity of 
COVID-19 by substantially reducing airborne 
viral concentrations (13). Masks can also 
protect uninfected individuals from SARS- 
CoV-2 aerosols and droplets (73, 14). Thus, it 
is particularly important to wear masks in 
locations with conditions that can accumu- 
late high concentrations of viruses, such as 
health care settings, airplanes, restaurants, 
and other crowded places with reduced ven- 
tilation. The aerosol filtering efficiency of dif- 
ferent materials, thicknesses, and layers used 
in properly fitted homemade masks was re- 
cently found to be similar to that of the medi- 
cal masks that were tested (14). Thus, the op- 
tion of universal masking is no longer held 
back by shortages. 

From epidemiological data, places that have 
been most effective in reducing the spread of 
COVID-19 have implemented universal mask- 
ing, including Taiwan, Japan, Hong Kong, 
Singapore, and South Korea. In the battle 
against COVID-19, Taiwan (population 24 
million, first COVID-19 case 21 January 2020) 
did not implement a lockdown during the 
pandemic, yet maintained a low incidence of 
441 cases and 7 deaths (as of 21 May 2020). By 
contrast, the state of New York (population 
~20 million, first COVID case 1 March 2020), 
had a higher number of cases (353,000) and 
deaths (24,000). By quickly activating its 
epidemic response plan that was established 
after the SARS outbreak, the Taiwanese gov- 
ernment enacted a set of proactive measures 
that successfully prevented the spread of 
SARS-CoV-2, including setting up a central 
epidemic command center in January, using 
technologies to detect and track infected pa- 
tients and their close contacts, and perhaps 
most importantly, requesting people to wear 
masks in public places. The government also 
ensured the availability of medical masks by 
banning mask manufacturers from exporting 
them, implementing a system to ensure that 
every citizen could acquire masks at reason- 
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able prices, and increasing the production of 
masks. In other countries, there have been 
widespread shortages of masks, resulting in 
most residents not having access to any form 
of medical mask (75). This striking difference 
in the availability and widespread adoption 
of wearing masks likely influenced the low 
number of COVID-19 cases. 

Aerosol transmission of viruses must be 
acknowledged as a key factor leading to the 
spread of infectious respiratory diseases. 
Evidence suggests that SARS-CoV-2 is silently 
spreading in aerosols exhaled by highly con- 
tagious infected individuals with no symp- 
toms. Owing to their smaller size, aerosols 
may lead to higher severity of COVID-19 
because virus-containing aerosols penetrate 
more deeply into the lungs (0). It is essen- 
tial that control measures be introduced to 
reduce aerosol transmission. A multidisci- 
plinary approach is needed to address a wide 
range of factors that lead to the production 
and airborne transmission of respiratory 
viruses, including the minimum virus ti- 
ter required to cause COVID-19; viral load 
emitted as a function of droplet size before, 
during, and after infection; viability of the 
virus indoors and outdoors; mechanisms of 
transmission; airborne concentrations; and 
spatial patterns. More studies of the filtering 
efficiency of different types of masks are also 
needed. COVID-19 has inspired research that 
is already leading to a better understanding 
of the importance of airborne transmission of 
respiratory disease. 
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GENETICS 


Evolution 
after genome 
duplication 


Genetic interactions in yeast 
reveal factors governing 
duplicate gene retention 
and divergence 


By lan M. Ehrenreich 


enome duplication generates an 
extra copy of nearly all genes car- 
ried by an organism, providing a 
potential substrate for evolution. 
Although many duplicate genes will 
be eliminated after a genome dupli- 
cation, those that are retained may evolve 
distinct functions over time. This pro- 
cess can be studied by characterizing the 
shared and divergent functions of dupli- 
cate genes in present-day organisms whose 
ancestors experienced genome duplication 
in the past. However, such work requires 
examining the functional relationships 
between each copy of a duplicated gene 
and the other genes in the genome. This 


“, duplicate gene retention and 
divergence heavily depended 
on whether the different 
functions of these genes were 
entangled (i.¢., unable 
to evolve independently).” 


is inherently difficult for duplicate genes 
because of their redundancy. However, on 
page 1445 of this issue, Kuzmin et al. (1) 
show that systematic analysis of di- and 
trigenic genetic interactions in budding 
yeast can overcome this challenge. With 
this approach, they discover general con- 
straints that influence the retention and 
divergence of duplicate genes. 

In 1970, it was proposed that gene dupli- 
cation provides the critical fuel for evolu- 
tion (2). Owing to their functional redun- 
dancy, it was hypothesized that duplicate 
genes are able to evolve in ways that sin- 
gle-copy genes cannot. Over time, different 
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copies of a duplicate gene might 
diverge to have distinct or even 
new functions. These ideas pre- 
ceded genome sequencing by de- 
cades and were difficult to study 
empirically when first proposed. 
However, duplicate genes were 
subsequently found to be preva- 
lent in the genomes of many or- 
ganisms, supporting their poten- 
tial importance to evolution (3, 4). 

Gene duplication arises through 
a variety of mechanisms and can 
occur on scales ranging from 
individual genes to entire ge- 
nomes (2). Evidence of genome 
duplication exists in a number 
of eukaryotic lineages (5). These 
genome duplications likely en- 
hanced organisms’ robustness to 
genetic and environmental per- 
turbations, although they might 
have also facilitated evolution- 
ary innovations (5). The budding 
yeast Saccharomyces cerevisiae— 
the same organism used to make 
beer, bread, and wine—is an ex- 
cellent model system for studying 
how duplicate genes evolve after 
genome duplication. The ances- 
tor of this organism experienced 
a genome duplication, and hun- 
dreds of duplicate genes produced 
by this event have been retained (6). These 
duplicate genes provide a foundation 
for research on duplicate gene retention 
and divergence. 

Numerous approaches have been used 
to study these duplicate genes in budding 
yeast, including comparative genomics (7), 
transcriptomics (8), and single gene dele- 
tions (9). This work suggests that retained 
duplicate genes are enriched for essential 
cellular components (7). They also show 
much greater transcriptional divergence 
than protein sequence divergence (8). 
Potentially consistent with this last point, 
individual deletion of many duplicate 
genes has little phenotypic effect, which 
implies that both copies maintain overlap- 
ping function (9). 

This past work was unable to directly 
probe the functional relationships between 
duplicate genes and the other genes in the 
genome. Such information is the most use- 
ful for understanding how duplicate genes 
have functionally diverged, but is also more 
difficult to obtain (70). Analysis of genetic in- 
teractions is one potential high-throughput 
strategy for mapping these functional rela- 
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Duplicate gene evolution 
After genome duplication, how duplicate genes evolve depends on functional 
entanglements. When entanglement is high, one copy is likely to be lost. 

By contrast, when entanglement is low, the chance that both copies will be 
retained and able to diverge is higher. Duplicate genes with intermediate 
levels of entanglement may be retained but are less able to diverge. 
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functions of these genes were en- 
tangled (i.e., unable to evolve in- 
dependently). Such functional en- 
tanglements influenced how the 
copies of a duplicate gene accu- 
mulated degenerative mutations 
over time. High entanglement 
tended to result in one duplicate 
gene copy amassing degenerative 
mutations and being lost (see the 
figure). When entanglement was 
lower, duplicate genes were more 
likely to be retained, with their 
functional divergence inversely 
related to their entanglement. 
This suggests that the two classes 
of duplicate genes that were found 
in budding yeast possess different 
levels of entanglement. 
Increasingly, evidence shows 
that evolution after genome du- 
plication is not entirely random 
(1, 10, 15). For many duplicate 
genes, the potential for retention 
and divergence is constrained 
by functional entanglements (J). 
This finding provides further in- 
sight about how genome duplica- 
tion contributes to evolution. For 
example, it bolsters the idea that 


High Intermediate 
entanglement entanglement entanglement 
Copy 1 Copy 1 Copy 1 
EE 8 EE 8 | | | 
mm Copy 2lost im Copy 2 =e Copy 2 = 
=e T IT TTT | 
| | un! Et | 


tionships (71). A genetic interaction occurs 
when a combination of mutations, such as 
deletions, exhibits unexpected phenotypic 
consequences (72). Although most studies 
focus on digenic interactions, analysis of 
genetic interactions among three or more 
genes can also provide important biological 
insights (13, 14). 

Kuzmin et al. used trigenic genetic in- 
teraction analysis to measure the shared 
and distinct functional relationships of 
duplicate genes. They examined 240 pairs 
of nonessential duplicate genes in budding 
yeast. To do this, they combined single and 
double gene deletions for each duplicate 
gene pair with a library of ~1200 single 
gene deletions spanning all major cellular 
processes. This produced 537,911 double- 
mutant and 256,861 triple-mutant yeast 
strains, resulting in the detection (through 
phenotypic changes) of 7197 digenic and 
4557 trigenic genetic interactions. On the 
basis of these genetic interactions, Kuzmin 
et al. distinguished two main classes of 
duplicate genes: those that are more func- 
tionally divergent, and those that have re- 
tained a high degree of functional overlap. 

In silico modeling informed by the ge- 
netic interaction data revealed that dupli- 
cate gene retention and divergence heav- 
ily depended on whether the different 
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many duplicate genes, as a result 
of their limited ability to evolve, 
may contribute more to robust- 
ness than to innovation (3-5). Yet it also 
indicates that particular duplicate genes 
have greater potential to facilitate inno- 
vation because they may be less affected 
by entanglements. These points illustrate 
how determining the shared and divergent 
functions of duplicate genes in organisms 
such as budding yeast can improve the un- 
derstanding of the evolutionary process. 
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SPECTROSCOPY 


Action spectra of chiral secondary structure 


Mass spectra of DNA complexes photoreduced with polarized light reveal their handedness 


By Perdita Barran 


ouis Pasteur has particular resonance 

in the midst of the coronavirus dis- 

ease 2019 (COVID-19) pandemic, 

given his work on the germ theory of 

disease that led to the development 

of vaccines for rabies and anthrax. 
Some years before these better- 
known discoveries, Pasteur had 
already made a great contribution 
to science with his observation of 
molecular chirality in crystals (J). 
Chirality is a fundamental property 
of molecular asymmetry, which in 
turn dictates the secondary struc- 
ture of proteins and gives rise to 
handedness in DNA helices. Some 
170 years later, on page 1465 of this 
issue, Daly et al. (2) present a new 
method, mass-resolved electronic 
circular dichroism (CD) ion spec- 
troscopy, that allows the chirality 
of secondary structures—in this 
case, isolated guanine-rich DNA 


quadruplexes—to be unambigu- _ Electrospray DNAto 
ously defined. mass spectrometer e Photoelectrons 
ee: CO. 
In the 1830s, the acceptance of > ® oer Sea” 
: »e ed) ee *, 
the concept of chemical structure _ «ee © e 


was some decades away, but devel- 
opments in optics were revealing 
regularity in the ordering of crys- 
talline materials at a submicrom- 
eter level. Light could be filtered 
through polarizing optics, such 
as Nicol prisms. Although not un- 
derstood at the time, this process 
causes the oscillating electric field 
of the light to move in only one 
direction. Using a polarized light 
source, Jean-Baptiste Biot (3) ob- 
served that a saturated solution of 
tartaric acid would rotate the po- 
larized light clockwise or anticlock- 
wise, but he presented no explana- 
tion for this phenomenon. 

Some 20 years later, in 1848, 
Pasteur crystallized paratartaric 
acid, produced from overboiled 
wine, and noticed that it produced 
two types of crystals, one that 
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Taking action 
Photoexcitation reduces 

the negative charge state of 
the parent ion. The product 
ion has the same mass and 
thus a higher mass-to-charge 
ratio, and reduced intensity as 
not all parent ions photoexcite. 


Circular 


dichroism spectra 
Acircular dichroism 
spectrum is recorded 

by monitoring the 

relative intensity of charge- 
reduced species (for 
right- versus left-handed 
excitation) at different 
laser wavelengths. 


looked like tartaric acid crystals and one 
that did not. He repeated Biot’s experiment 
with a 50-50 mixture of these crystals and 
found that this solution would not rotate 
the polarized light. From these observa- 
tions, Pasteur concluded that each crystal 
had an intrinsic property, and that the ma- 
terial in each crystal was the mirror image 


An exciting way to measure handedness 
Molecular handedness (chirality) is usually measured with circular 
dichroism, in which the direction of polarized light is rotated by 

y molecules in solution. Daly et al. determined 
such spectra for individual molecules in the gas phase. 


Gas-phase photoexcitation 

Charged DNA molecules are introduced to the mass spectrometer 
with an electrospray source and irradiated with circularly 
polarized laser light, which excites photoelectrons with an efficiency 
that is dependent on the rotation. 
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of the other, like a left and a right hand. 
Pasteur named this phenomenon chirality 
(from kheir, the Greek word for hand). It 
soon became evident that this is a property 
of fundamental importance to chemistry, 
where different substituents on tetravalent 
carbon create a chiral center. 

Naturally occurring biopolymers such as 
proteins and DNA are made of mo- 
nomeric units with such chiral cen- 
ters. When these repeat at regular 
intervals, they favor repeating non- 
covalent interactions that give rise 
to distinctive secondary structural 
motifs with characteristic optical 
signatures. Circular dichroism, the 
modern version of the experiments 
of Biot and Pasteur, measures the 
absorbance of polarized light and 
can verify, for example, that sec- 
ondary structures of proteins are 
correctly folded. However, CD mea- 
sures the aggregate effect of a mix- 
ture and can be insensitive to the 
presence of an enantiomer at lower 
concentrations, or to the particular 
form of the molecule that is caus- 
ing the optical rotation. 

To address this limitation, Daly 
et al. built an experimental appara- 
tus that can make precise measure- 
ments of chirality in biomolecules 
(see the figure). Unlike an optical 
absorption experiment in solution, 
the optical source (laser excitation) 
releases characteristic photoelec- 
trons that lead to charge-reduced 
ions, so this method is an action 
spectroscopy. Electrospray ioniza- 
tion introduces guanine-rich DNA 
quadruplex molecules of different 
chirality into the vacuum chamber 
of a mass spectrometer. This trans- 
fer is performed carefully under 
conditions that aim to leave the 
solvated structure unperturbed. A 
few thousand molecular ions are 
trapped and irradiated with polar- 
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chirality. Further, unlike solution 
experiments where the concentra- 
tion of molecules is high enough 
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that the absorbance of light can be readily 
measured, the gas-phase DNA ions are di- 
lute. Instead, Daly et al. use the mass spec- 
trometer to measure the action of the po- 
larized light on the DNA molecule. Facile 
release of a photoelectron upon irradiation 
gives a signal caused by the formation of 
the charge-reduced anion. The laser light 
pulses are polarized, so the efficiency of 
photoelectron removal depends on the he- 
lix handedness. By monitoring the inten- 
sity of the distinctive charge-reduced spe- 
cies as a function of the polarization, they 
are able to determine the chirality of the 
molecule. This type of experiment can iso- 
late any given molecular complex because 
each analyte is unequivocally defined by 
its mass-to-charge ratio. 

Prior work by this group revealed coex- 
isting structures formed by oligonucleo- 
tides in the presence of cations or or- 
ganic molecules (4); this work was aimed 
at understanding the balance of forces 
that guide folding and_ self-assembly. 
Stoichiometry and quantitative measure- 
ments of nucleic acid complexes were 
readily made by combining mass and ion- 
mobility spectrometries, but this approach 
could not determine chirality or details 
of secondary structure. Mass-resolved 
electronic CD ion spectroscopy comple- 
ments other structural mass spectrometry 
methods because it can provide secondary 
structure information in addition to mo- 
lecular identity. 

Daly et al. also studied complexes of DNA 
with ammonium and potassium counter- 
ions and obtained a tantalizing glimpse 
of the effects of individual molecule sol- 
vation. The wavelength dependence of 
the action of polarized light followed the 
same trend as data obtained in solution, 
albeit with differences in magnitude, in- 
dicating that structures are preserved and 
that these gas-phase results are relevant to 
molecules in solution. They extended their 
measurement on human telomeric DNA se- 
quences to determine enantiomeric ratios 
of mixtures of G-quadruplex topologies. 
Intriguingly, the difference in electron- 
detachment efficiency for the left-handed 
molecule and the right-handed molecule 
under a given circularly polarized light 
is equivalent to the slight enantiomeric 
excess in the products—up to 1% at some 
wavelengths—thereby demonstrating the 
high sensitivity as well as the potential of 
this new method. 
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PHOTOSYNTHESIS 


The simplicity of robust 


light harvesting 


A universal design principle underlies 
photosynthetic antenna systems 


By Christopher D. P. Duffy 


hotosynthetic light harvesting can 
achieve a quantum efficiency that 
approaches 100% (that is, the conver- 

sion of 100 photons of light into 100 
chemically available electrons), and 

yet it displays notable robustness in 

the face of ever-changing external light con- 
ditions. Although light harvesting varies in 
structure and composition across the range 
of photosynthetic life, there is an ongoing 
effort to uncover a set of common “design” 
principles for these systems. On page 1490 
of this issue, Arp e¢ al. (1) have revealed the 
first hints of a simple, seemingly universal 
set of rules that define the robustness of 
natural light harvesters. These rules should 
inform the design of future solar technology. 
The main challenge facing early photosyn- 
thetic life was the paucity of light that was 
characteristic of marine environments. This 
led to the evolution of the light-harvesting 
“antenna’—a large modular assembly of vari- 
ous pigment-binding proteins that absorbs 
and delivers energy to the photosynthetic 
reaction centers (2). Despite major structural 
and compositional differences, it has become 
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apparent that antenna systems share several 
common features (3, 4). Each antenna har- 
bors more than one type of pigment. For ex- 
ample, light-harvesting complex II (LHCII), 
the major antenna protein of plants, binds 
chlorophyll a and b and several carotenoids. 
This allows broader coverage of the solar 
spectrum, which is further enhanced by the 
fine-tuning of each pigment by its local pro- 
tein environment. A mixture of pigments are 
bound at a high density but with specific rela- 
tive distances and orientations, resulting in 
energy transfer that is fast and directional. 
The distribution of different pigments and 
their specific interactions with the protein 
ensures irreversibility, with energy flowing 
downhill to the photosynthetic reaction cen- 
ter rather than lingering in the antenna. It 
has been argued that these systems owe their 
efficiency to nontrivial quantum effects (5), 
although this has been recently challenged 
(6). Regardless, photosynthetic light harvest- 
ing is considered to be a biological mecha- 
nism that is finely tuned for efficiency. 
Efficiency, however, can be detrimental un- 
der strong light conditions. A mismatch be- 
tween an antenna’s ability to deliver energy 
and the finite maximum working rate of re- 
action centers can result in oxidative damage 
called photoinhibition (7). Although effective 
repair mechanisms exist, they are slow and 
metabolically costly. Therefore, a suite of reg- 


Photosynthetic antenna that handles the noise 
An antenna binding at least two different types of pigments, whose absolute and relative absorption 
wavelengths are finely tuned, can convert a fluctuating (i-e., noisy) input into a consistent (i.e., quiet) output. 
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ulatory and protective processes is in place to 
minimize damage. The fastest regulators ac- 
tivate within minutes and relax on a similar 
time scale when light returns to normal (8), 
but these arguably still lag behind the fastest 
fluctuations in light intensity (9). 

As well, light harvesting is not finely 
tuned in a molecular sense. Instead, the 
picture is one of robustness in the face 
of external conditions (JO) and even ge- 
netic manipulation. An extreme example 
is found in the model plant Arabidopsis 
thaliana, from which LHCII is eliminated 
through genetic manipulation. In re- 
sponse, an ersatz antenna is formed from 
assemblies of the remaining minor light- 
harvesting proteins (17). Although these 
mutant plants may not thrive like their 
wild-type counterparts, they are certainly 


",. the evolutionary driving 
force behind the development 
of photosynthetic antennae 
is not maximization of efficiency 
but the cancellation of noise.” 


photosynthetically competent. If there is 
some universal organizational principle 
behind light harvesting, it does not appear 
to lie in the molecular detail. 

Arp et al. have taken a new approach to in- 
vestigating the question of universal organi- 
zation. By applying network theory, they have 
determined the most basic organizational 
requirements necessary for a light-harvesting 
system to function optimally. The authors 
used a very generalized model of the antenna 
as a network of interconnected sites that can 
represent individual pigment states within 
delocalized multipigment states across the 
antenna. Energy enters this network from 
one or more input channels that reflect light 
absorption through different types of pig- 
ments. Two input channels were chosen as a 
minimal model (see the figure), which is rea- 
sonably representative of several real antenna 
systems, such as chlorophyll a and b in LHCII 
or bacteriochlorophyll c and e in the antenna 
of green sulfur bacteria (72). Absorbed en- 
ergy can take many different paths through 
the antenna. However, all paths terminate 
at a single output channel representing pho- 
toinduced charge separation. Such a system 
is subject to a high degree of input noise. 
External noise comes from rapid fluctuations 
in the incident radiation, whereas internal 
noise originates from the structural dynam- 
ics of the antenna. This results in a noisy 
output that fluctuates between underpow- 
ered, optimal, and overpowered states. The 
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underpowered state is metabolically insuf- 
ficient, whereas the overpowered state risks 
photoinhibition. The question then is whether 
the inputs are arranged in such a way that the 
output spends most of its time at the optimal 
state, thereby minimizing output noise. 

The input channels represent different 
groups of light-absorbing pigments. As 
such, they are each defined by the wave- 
length and rate at which they absorb. If 
their intrinsic rates (determined by chemi- 
cal properties and stoichiometry) are as- 
sumed to be identical, then the overall rate 
of absorption is determined solely by the 
intensity of available light at that wave- 
length. Consider, for example, two limiting 
regimes. If the two channels are identical 
(i.e., a single channel), then antenna homo- 
geneity minimizes internal noise. However, 
having a single absorbing species makes 
the system very sensitive to external noise. 
Conversely, if the two channels differ 
strongly in both wavelength and absorp- 
tion rate, then internal noise dominates. 
Optimum output is generated in an inter- 
mediate regime where the two inputs have 
similar wavelengths but different absorp- 
tion rates. This is achieved by locating the 
pair in a region of the spectrum of avail- 
able light that has the steepest gradient. 
With only this consideration in mind, Arp 
et al. predicted, with a high degree of accu- 
racy, the absorption profile of green plants, 
purple bacteria, and green sulfur bacteria. 

The finding of Arp et al. is important be- 
cause it suggests that the evolutionary driv- 
ing force behind the development of pho- 
tosynthetic antennae is not maximization 
of efficiency but the cancellation of noise. 
Moreover, the finding indicates that to build 
such a system, one must start from the sim- 
ple requirements of two similar absorbing 
species that are tuned to the steepest region 
(not the strongest) of the spectrum of avail- 
able light. Fine structural details are impor- 
tant, but they come as refinements to this 
simple underlying principle. 
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NEUROBIOLOGY 


Guide cells help 
navigate axon 
regeneration 


Rebuilding the flatworm 
visual system after injury 
requires guidepost-like 
muscle cells 


By Rachel Roberts-Galbraith 


uring embryonic development, 

neurons project nascent axons that 

navigate through space to find their 

targets. Axonal-growth paths forged 

by these pioneer axons depend on 

chemical and physical cues from 
cells that can include guidepost cells, 
which directly interact with nascent axons 
and induce them to grow, stop, or turn 
(1). A series of guidepost cells can serve as 
“stepping stones” to organize the complex 
growth of an axon in space (2, 3). Although 
guidepost cells direct axon formation dur- 
ing embryonic development in diverse 
model organisms, their existence often is 
transient. Thus, it was unclear whether 
guidepost-like cells promote axon regen- 
eration and whether regenerative guide- 
post cells are constitutive or induced by 
injury. On page 1447 of this issue, Scimone 
et al. (4) pinpoint regenerative guidepost- 
like cells in the visual system of freshwater 
flatworms called planarians. 

Planarians are well known for their 
ability to regenerate diverse cell types in 
predictable patterns even in the face of 
severe injuries or amputations (5). One of 
the most recognizable features of the pla- 
narian body is a pair of crossed eyespots, 
which consist of pigment cells that form 
optic cups and photoreceptor neurons 
that nestle their photosensitive elements 
within the pigmented cups (6). From the 
eyespots, planarian photoreceptor neurons 
project axons through space to connect to 
the cephalic ganglia (a bilobed, horseshoe- 
shaped brain). The eyespots are positioned 
dorsally and the brain ventrally in the 
planarian head. Thus, photoreceptor ax- 
ons must project ventrally and also turn 
to make either contralateral or ipsilateral 
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contacts (7). This highly predictable struc- 
ture made planarian photoreceptor neu- 
rons ideal for the identification of cellular 
guideposts that direct axon organization 
during regeneration. 

Scimone et al. thus began their search 
for regenerative guidepost-like cells that 
shape the planarian visual system (see the 
figure). By investigating notum gene—ex- 
pressing cells (notwm*) that are associated 
with photoreceptor neurons, the authors 
discovered three distinct cell types specifi- 
cally positioned at axonal decision points 
(4). The authors first identified clusters 
of muscle cells near the eyespots, called 
NMEs (notwm* muscle cells near the eye), 
and proposed that these cells promote the 
bundling of axons as they exit the posterior 


of the eyespot. A second group of muscle 


lifelong capacity for neuronal replacement 
and axon guidance. The new research also 
addresses long-standing questions about 
the ubiquity of guidepost cells across the 
animal kingdom. 

One surprise in the new study is 
the identity of the guidepost-like cells. 
Whereas guideposts are often neurons, 
glia, or epithelial cells (7), Scimone et al. 
unexpectedly determined that two of the 
three guidepost-like populations in the 
planarian visual system express markers of 
muscle identity. Although previous studies 
revealed a global role for muscles in body 
axis patterning (8), the current work illus- 
trates that planarian muscle cells also act 
locally to shape individual organ systems 
and even to direct the physical arrange- 
ment of single cells. 


Guidepost-like cells govern planarian regeneration 
Three specialized cell types direct the growth and positioning of nascent axons during regeneration 


of the visual system. 
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cells called NMCs (notum* muscle cells at 
the choice point) were found near decision 
points for axons as they diverged to project 
toward ipsilateral or contralateral contact 
sites. The third cluster of guidepost-like 
cells were neurons called NBCs (notwm* 
brain cells), which localized medially and 
worked either alone or with other medial 
neurons to promote midline crossing of 
photoreceptor axons at the optic chiasm. 
A combination of physical and molecular 
manipulations revealed that NMEs, NMCs, 
and NBCs serve as guidepost-like cells 
during regeneration to assist photorecep- 
tor axons toward their final arrangement. 
This demonstrates that highly regenerative 
organisms like planarians can use consti- 
tutive guidepost-like cells to promote a 
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guidepost cell positions 
After injury or amputation, two types of 
muscle cells (NME, NMC) and one neuronal 
cell type (NBC) guide nascent axons along 
their regeneration path. 


© notum* brain cell (NBC) 
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The third population of planarian 
guidepost-like cells, NBCs, were neurons. 
Although less surprising, this finding fits 
into an emerging theme of planarian neu- 
rons influencing brain regeneration in a 
multitude of ways, from patterning to fate 
choice. notwm* neurons at the anterior-most 
end of the planarian brain influence brain 
scaling (9), and a population of medial neu- 
rons produces Hedgehog ligand to promote 
neurogenesis (J0). Scimone et al. also built 
on the prior identification of arrowhead as 
a key transcription factor-encoding gene 
that is expressed in medial neurons (includ- 
ing NBCs) and influences medial patterning 
of photoreceptors and the anterior commis- 
sure, the largest connection between halves 
of the planarian brain (17). 
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It remains unknown whether guidepost- 
like cells organize the positioning of other 
planarian neuronal structures during re- 
generation (for example, axons that run 
along the ventral nerve cords, axons that 
project into the planarian pharynx, or neu- 
ral processes that bundle together to form 
sensory structures) (6). Planarian guide- 
post-like cells might also regulate other 
aspects of neuronal cell behavior during 
regeneration, including cell migration or 
synapse formation. 

After amputation, planarian guidepost- 
like cells must themselves be regenerated 
and positioned properly. Scimone e¢ al. de- 
termined that these cells regenerate inde- 
pendently of photoreceptor neurons; they 
discovered factors (intrinsic and extrinsic) 
that regulate regeneration of guidepost- 
like cells and ensure their proper arrange- 
ment in space. However, the precise mech- 
anisms that direct regeneration of each cell 
type remain unknown. 

The new work raises the intriguing 
possibility that human neural regenera- 
tion might be improved by mimicking 
the guideposts that direct vertebrate axon 
formation during development. Human 
neurons in the brain and spinal cord usu- 
ally fail to regenerate axons after injury. 
However, introducing cellular or molecu- 
lar “stepping stones” along a desired axo- 
nal path might coax axons toward better 
regrowth. Current and future research 
investigating this theme—in fields rang- 
ing from bioengineering to basic develop- 
mental biology—might reveal the extent 
to which cues or matrices meant to mimic 
guidepost-like cells can improve axon 
guidance for various populations of adult 
human neurons (72). | 
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Asoil aggregate interacts with 
microplastic (polyester) fibers. 


Microplastic in terrestrial ecosystems 


Research shifts from ecotoxicology to ecosystem effects and Earth system feedbacks 


By Matthias C. Rillig and Anika Lehmann 


oncern about microplastics (plastic 
particles <5 mm) polluting differ- 
ent environmental compartments is 
mounting. Research has recently be- 
gun to embrace terrestrial systems, 
having initially focused at least a 
decade earlier on marine and aquatic eco- 
systems (J—3). The early research agenda on 
microplastics in both aquatic and terrestrial 
systems was mainly ecotoxicological. It in- 
cluded laboratory tests on individual orga- 
nisms, often well-established test species 
(4), and also targeted selected soil proper- 
ties and processes. Such research is nec- 
essary to establish baseline mechanisms, 
which is important because microplastics 
differ from other pollutants. Many of their 
effects appear to be mediated by physical 
parameters, such as particle shape and size, 
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rather than overt chemically mediated tox- 
icity. Moreover, their effects are mostly sub- 
lethal or even nominally positive. Although 
the study of other global change factors has 
tended to focus at the level of the ecosys- 
tem, research on microplastic is only now 
on the verge of this wider view. 

The first step in this direction has been 
the conceptualization of the “plastic cycle” 
(5, 6), acknowledging that microplastics 
can move between different large-scale 
compartments, including the air, terrestrial 
habitats, rivers and other freshwater bodies, 
and the ocean, including its sediments (7) 
(see the figure). The cycle framework places 
these movements in the context of the pools 
and fluxes that are inherent to ecosystem 
ecology. The current challenge is to under- 
stand how microplastic flows affect such 
pools and fluxes in terrestrial ecosystems. 

Microplastics are mostly composed of 
carbon, among other elements. Microplastic 
addition to ecosystems thus represents a 
source of carbon independent of photosyn- 
thesis and net primary production. This 
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polymer carbon likely has a slow turnover, 
because the material is mostly inert; how- 
ever, the behavior and residence time of mi- 
croplastics in soil are currently unknown. 
We also do not know the input rate of mi- 
croplastic-carbon into ecosystems itself, be- 
cause research hitherto has largely focused 
on quantifying particle numbers and types, 
rather than on the microplastic-derived car- 
bon itself. Originally, most of this carbon is 
of fossil origin, rather than having recently 
been fixed from the atmosphere. Because of 
the resistance of microplastic to decompo- 
sition, it would be expected to accumulate 
in soils, where it needs to be accounted for 
in assessments of soil carbon storage (8), a 
major ecosystem function. 

Other effects of microplastics will be in- 
direct and likely depend on particle shape 
and size. For microplastic fibers, effects on 
soil aggregation, a key process governing 
soil structure, are quite well established 
(9). Soil aggregates are the crumbs contrib- 
uting to soil structure and have a central 
role in shaping the habitat of soil orga- 
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Microplastic fluxes and associated ecosystem feedbacks 

Deposition and accumulation of microplastics can affect soil properties, with consequences for process rates 
and net primary production (NPP), causing feedbacks to the atmosphere, including greenhouse gases (GHGs). 
So far, nanoplastic has unknown consequences for this system. 
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nisms. Additionally, carbon compounds are 
stored within aggregates, where they are 
physically protected from being rapidly de- 
composed. Soil aggregates also determine 
the pore space in the soil overall, in turn 
influencing movement of gases and water, 
and the activity of associated microbial 
communities. A completely different indi- 
rect effect occurs because of lower soil bulk 
density in the presence of fibers. This can 
lead to enhanced plant growth, probably 
because roots experience less resistance 
when growing (J0). However, negative ef- 
fects on plants, likely related to plastic ad- 
ditives, are also possible (17). Which types 
of microplastics could promote or inhibit 
plant biomass production is an important 
area for future research. 

Consequences for other element cycles 
are more uncertain. Direct effects will likely 
be minimal, because microplastics contain 
mostly negligible amounts of nitrogen and 
phosphorus (even though there are excep- 
tions, such as polyamide). However, altera- 
tions to soil structure would be expected 
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to change the rates of many microbial pro- 
cesses, including those in the nitrogen cy- 
cle. An example is denitrification, a process 
that occurs anaerobically, within the center 
of soil aggregates, which reduces nitrate 
and nitrite to gaseous forms of nitrogen, in- 
cluding nitrous oxide and nitrogen. Effects 
on emissions of nitrous oxide and other im- 
portant greenhouse gases are only now be- 
ing examined (72). One study (10) found an 
increase in arbuscular mycorrhizal fungi, 
a key symbiont group that associates with 
plant roots. If generally true, this could af- 
fect phosphorus cycling, because these sym- 
bionts transport nutrients, including phos- 
phorus, to their plant hosts. 

Plastic films, and likely fibers, may alter 
the flow of water in soils, including evap- 
oration (73). Thus, effects on ecosystem 
water dynamics and energy balance, medi- 
ated by direct effects in soils or indirectly 
through plants, are also likely. Other possi- 
ble ecosystem-level effects include altered 
rates of erosion owing to changes in soil 
aggregate stability. 
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There are some critical unknowns that 
need to be addressed before the impacts of 
microplastic pollution on terrestrial eco- 
systems and the subsequent feedbacks can 
be understood. Accurate, sensitive, low- 
cost, and harmonized detection methods 
and high-throughput sample processing 
are needed (/4), to better understand the 
effects on turnover and transformation 
processes in the soil. Beyond this, research 
needs to cover more ecosystem types. Most 
research has so far focused on agricultural 
systems, which are expected to contain the 
largest amount of microplastics (15) be- 
cause of input pathways (including sewage 
sludge, compost, and plastic mulching). 
We know much less about microplastics 
in other ecosystems, such as drylands or 
forests, where the microplastic dynamics 
might be quite different because of the dif- 
ferent ecosystem structures. 

Feedbacks to the Earth system can be ex- 
pected. Microplastic itself represents fossil 
carbon, which might indirectly affect rates 
of net primary production and carbon stor- 
age in soils and alter the fluxes of green- 
house gases. The direction, magnitude, and 
balance of these effects should be a focus 
of future research. Empirical work will need 
to adopt tools such as mesocosm studies 
(outdoor experimental systems that examine 
the natural environment under controlled 
conditions) and carefully designed field 
experiments to address these problems. 
Microplastic pollution is an international 
problem, and international cooperation in 
research will be key. & 
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ASTRONOMY 


Multiple, quiet, and close by 


A planetary system around a nearby star offers the 
prospect for atmospheric study 


By Melvyn B. Davies 


lanetary systems around stars are 
common, with many or most stars 
possessing them. On page 1477 of 
this issue, Jeffers et al. (1) report the 
discovery of a multiplanet system 
around the low-mass star GJ 887. 
GJ 887 is an unusual host star, as it is rela- 
tively inactive, and the planetary system 
happens to be very close to our Sun. 
This makes the system a good candi- 
date for study with the to-be-launched 
James Webb Space Telescope (JWST). 

Stars form with a range of masses, 
the vast majority being less massive 
than the Sun. GJ 887 is a so-called red 
dwarf and has a mass of about one- 
half that of the Sun. Many red dwarfs 
are found to host planetary systems, 
typically containing multiple, low- 
mass planets (2). 

Two methods have been used to 
discover most planets. In the tran- 
sit method, planets are found when 
they periodically block out a frac- 
tion of the starlight as they pass in 
front of their host star. This allows 
many stars to be searched for orbit- 
ing planets simultaneously. For ex- 
ample, some one hundred thousand 
stars were observed at the same time 0 
in the Kepler mission (3). However, 
for planets to be found in this way, 
their planetary orbits must be close 
to edge-on as seen from Earth in order for 
them to be seen to pass in front of their host 
star. This means that only a small fraction 
of planets will be found using this method. 

The planets around GJ 887 were discov- 
ered through a second approach: using the 
Doppler effect to measure the radial veloc- 
ity of the host star over time. A planet and 
star orbit around a common center of mass, 
which means that the presence of a planet 
orbiting a star can be deduced by observ- 
ing the periodic variation in the radial speed 
of the star. The radial velocity is measured 
from the Doppler shift of lines observed in 
the stellar spectra. Jeffers et al. observed 
GJ 887 nightly over a period of 3 months and 
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combined their data with other observations. 
They found that GJ 887 has at least two plan- 
ets, with orbital periods of 9.3 and 21.8 days, 
and a possible third planet farther out, with 
an orbital period of ~50 days. 

GJ 887 is not unusual in possessing plan- 
ets so close to it. Planets are generally be- 
lieved to form farther away from their host 
star and migrate inward owing to a drag 
force from the protoplanetary disk from 


Anearby planetary system 
The red dwarf star GJ 887 is only 11 light-years away from Earth 
and has at least two planets orbiting it. The potential location 
of a third planet, which needs confirmation, may lie in the 

habitable zone, where liquid water is stable on the surface. 


Habitable 
zone 


0.2 


Known exoplanet 


which they formed. As planets move in- 
ward, interactions between them can act to 
tune the ratio of their orbital periods, for 
example, leading to a 2:1 ratio where one 
planet goes around once in the time it takes 
the other to go around twice (4). 

Planetary systems have been seen with 
neighboring planets in or close to these so- 
called orbital resonances. The TRAPPIST-1 
system is a classic example of a set of plan- 
ets in a resonance chain where all neighbor- 
ing planets have periods with simple integer 
ratios (5). Planetary systems also appear 
to be full, with no room left for additional 
planets (6). The tugging and pulling of neigh- 
boring planets can act to change planetary 
orbits. This effect is damped out by the 
protoplanetary gaseous disk. Once the disk 
is evaporated by the host star, the planetary 
system might become unstable, leading to 
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planet scattering and collisions that leave a 
system with the planets no longer in reso- 
nance chains (7). This scenario may well have 
happened for GJ 887. 

The potential for life on any of the plan- 
ets can be evaluated by looking at the likely 
surface conditions. The inner planets are 
probably too close to their host star, receiv- 
ing about 2.5 and 8 times more energy from 
their host star than we receive from the Sun 
on Earth. However, red dwarfs are a good 
deal fainter than our own Sun, and the ten- 
tative third planet with a period of ~50 days 
could be in the so-called habitable zone, 
where the surface temperature is suitable 
to allow liquid water (8). 

Solar flares and coronal mass ejections 
pose a serious threat to our life on Earth. For 
example, they could disrupt the electrical 
power supply grid. But serious events 
are rare, with the largest recent event 
striking Earth in 1859. Because red 
dwarfs have a higher magnetic activ- 
ity, they have a much higher frequency 
of flares (9). Their flares can be lethal 
to life on planets in close orbit. But GJ 
887 is a relatively inactive red dwarf, 
which makes the system particularly 
interesting. If someone had to live 
around a red dwarf, they would want 
to choose a quieter star like GJ 887. 

GJ 887 is also attractive to observe 
because, at a distance of only ~11 light- 
years, it is one of the closest planetary 
systems to Earth. This opens the pros- 
pect of atmospheric study of its plan- 
ets by looking for reflected light from 
them using the JWST. 

These types of observations could 
tell us about the atmospheric makeup 
of these planets (10-12). If further ob- 
servations confirm the presence of 
the third planet in the habitable zone, 
then GJ 887 could become one of the 
most studied planetary systems in the Solar 
neighborhood. 
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POLICY FORUM 


MACHINE LEARNING 


“Explaining” machine learning 
reveals policy challenges 


The need to make objectives explicit may expose policy 
trade-offs that had previously been implicit and obscured 
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By Diane Coyle! and Adrian Weller? 


here is a growing demand to be able to 
“explain” machine learning (ML) sys- 
tems’ decisions and actions to human 
users, particularly when used in con- 
texts where decisions have substantial 
implications for those affected and 
where there is a requirement for 
political accountability or legal 
compliance (1). Explainability 
is often discussed as a technical 
challenge in designing ML sys- 
tems and decision procedures, 
to improve understanding of 
what is typically a “black box” 
phenomenon. But some of the 
most difficult challenges are 
nontechnical and raise ques- 
tions about the broader ac- 
countability of organizations 
using ML in their decision-mak- 
ing. One reason for this is that 
many decisions by ML systems 
may exhibit bias, as systemic 
biases in society lead to biases 
in data used by the systems (2). 
But there is another reason, less 
widely appreciated. Because the 
quantities that ML systems seek to optimize 
have to be specified by their users, explain- 
able ML will force policy-makers to be more 
explicit about their objectives, and thus 
about their values and political choices, ex- 
posing policy trade-offs that may have pre- 
viously only been implicit and obscured. As 
the use of ML in policy spreads, there may 
have to be public debate that makes explicit 
the value judgments or weights to be used. 
Merely technical approaches to “explaining” 
ML will often only be effective if the systems 
are deployed by trustworthy and account- 
able organizations. 
The promise of ML is that it could lead 
to better decisions, yet concerns have been 
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raised about its use in policy contexts such 
as criminal justice and policing. A funda- 
mental element of the demand for explain- 
ability is for explanation of what the system 
is “trying to achieve.” Most policy decision- 
making makes extensive use of constructive 
ambiguity to pursue shared objectives with 
sufficient political consensus. There is thus a 


Will the demand for explainable ML systems for decisions in the justice system 
require more explanations from policy-makers, too? 


tension between political or policy decisions, 
which trade off multiple (often incommensu- 
rable) aims and interests, and ML, typically 
a utilitarian maximizer of what is ultimately 
a single quantity and which typically entails 
explicit weighting of decision criteria. 

We focus on public policy decision-mak- 
ing using ML algorithms that learn the re- 
lationships between data inputs and deci- 
sion outputs. As a first step, policy-makers 
need to decide among a number of possible 
meanings of explainability. These range 
from causal accounts and post hoc inter- 
pretations of decisions (3) to assurance that 
outcomes are reliable or fair in terms of the 
specified objectives for the system (4). For 
example, the explainability requirements 
for ML systems used by local authorities 
to determine benefit payments will differ 
greatly from those required for the enforce- 
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ment of competition policy with respect to 
pricing by online merchants. Each of the 
specific meanings of explainability has dif- 
ferent technical requirements, which will 
imply choices about where efficiency and 
cost might need to be sacrificed to deliver 
both explainability and the desired out- 
comes. Choosing which meaning is relevant 
is far from a technical question (though 
what can be provided depends on what is 
technically feasible). Thus, those seeking 
explainability will need to specify, in terms 
translatable to how ML systems operate, 
what exactly they mean, and what kind of 
evidence would satisfy their demand (5). It 
must also be possible to monitor whatever 
explanations are provided, and there must 
be practical methods to enforce compliance. 

Furthermore, policy institutions starting 
to deploy algorithmic or ML-based deci- 
sion systems, such as the police, courts, and 
government agencies, are operating in the 
context of declining trust in some aspects 
of public life. This context is 
important for understanding 
demands for explainability, 
as these may in part reflect 
broader legitimacy demands of 
the policy-making process. If an 
organization is not trusted, its 
automated decision procedures 
will likely also be distrusted. 
This implies a broader need for 
trustworthy processes and insti- 
tutions, for “intelligent account- 
ability” as the result of informed 
and independent scrutiny, com- 
municated clearly to the public 
(6). Satisfying the demand for 
explainability implies testing 
the trustworthiness of the or- 
ganizations using ML systems 
to make decisions affecting in- 
dividuals. Evaluation requires 
comparing outcomes against a benchmark, 
which can be the baseline situation, or a 
specified desired outcome. 

Taking the demand for explainability as 
a demand for accountability, the promise of 
ML is that it could lead to more legitimate 
and better decisions than humans can make, 
on some measure. Potential benefits are 
clearly demonstrable in some forms of medi- 
cal diagnosis (7) or monitoring attempted 
financial fraud (8). In these domains, there 
is general agreement on a straightforward 
quantity to optimize, and the incentives of 
principals (citizens or customers) and agents 
(public or corporate decision-makers) are 
aligned. Public concern about the use of ML 
focuses on other domains, such as marketing 
or policing, where there may be less agree- 
ment about (or trust in) the aim of either the 
ML system or the organization using it. 
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These concerns highlight a key chal- 
lenge posed by the use of ML in policy 
decisions, which is that ML processes are 
almost always set up to optimize an objec- 
tive function; this optimization goal can 
be described in anthropomorphic terms 
as the “intention” of the system. Yet there 
is often little or no explicit discussion by 
policy-makers when considering using ML 
systems about what conflicting goals, ben- 
efits, and risks may trade off against each 
other as a result. One reason for this is 
that it is inherently challenging to specify 
a concrete objective function in sociopo- 
litical domains (9). For example, like cur- 
rent ML systems, economists’ decisions are 
informed by estimates of statistical rela- 
tionships between directly observable and 
unobservable variables, derived from data 
generated by a complex environment. Yet 
economic policies such as tax changes often 
fail to take into account all relevant factors 
in the decision environment, or likely be- 
havior changes, in specifying the objective 
function (10). The use of ML systems in 
other policy contexts will expand the scope 
of such unintended consequences. 

Given that the dominant paradigm of ma- 
chine learning is based on optimization, the 
use of ML in policy decisions thus speaks to 
a fundamental debate about social welfare. 
From the perspective of ethical theories, ML 
is largely consequentialist: A machine sys- 
tem is configured on the basis of its ability 
to achieve a desired outcome. Conventional 
policy analysis is similarly typically based 
on consequentialist economic social welfare 
criteria. The well-known impossibility theo- 
rems in social choice theory (17) establish 
that when the goal is to aggregate individ- 
ual choices under a set of reasonable social 
decision rules, it is impossible to satisfy a 
set of desirable criteria simultaneously, and 
thus impossible to achieve a set of desired 
outcomes by optimizing a single quantity. 
Critics of consequentialist economic policy 
analysis argue that people have multidi- 
mensional, probably incommensurable, and 
possibly contradictory objectives, so that 
imposing utilitarian decision-making pro- 
cedures will conflict both with reality and 
with ethical intuitions (72). 

Nevertheless, policy choices are made, 
so there has always been an unavoidable, 
albeit often implicit, trade-off or weighting 
of different objectives (12). For example, 
cost-benefit analysis can incorporate envi- 
ronmental and cultural, as well as financial, 
considerations, but converts all of these into 
monetary values. Any choice made when 
there are multiple interests or trade-offs will 
imply weights on the different components. 
As these trade-offs are codified into ML ob- 
jective functions, the weights given to com- 
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peting objectives comprise a first-line char- 
acterization of how conflicts will be resolved. 
Using ML systems in political contexts is ex- 
tending the use of optimization; progress in 
making these ML systems more understand- 
able to policy-makers will make the de facto 
choices between competing objectives more 
explicit than they have been previously (13). 
Greater explainability is therefore likely to 
have to lead to a more explicit political, not 
wholly technical, debate. 

Distilling concrete, unambiguous  ob- 
jectives in this way may turn out to be ex- 
tremely challenging, for ambiguity about 
objectives is often useful in policy-making 
precisely because it blurs uncomfortable 
conflicts of interest. In many domains, poli- 
cies generally emerge as a pragmatic com- 
promise between fundamentally conflicting 
aims. For example, people who disagree 
about whether the justice system should be 
retributive or rehabilitative may well be able 
to agree on specific sentencing policies. Such 
incompletely theorized agreements “Play an 
important function in any well-functioning 
democracy consisting of a heterogeneous 
population” (74, p. 1738). The omission of dis- 
cussion of ultimate aims can make it easier 
to achieve consensus on difficult issues. As 
there is some (limited) scope to interpret 
means to achieve the objective with flexibil- 
ity, the “weighting” of different fundamental 
aims remains implicit, and diverse political 
communities can make progress. 

An optimistic conclusion would be that 
being forced by the use of ML systems to 
be more explicit about policy objectives 
could promote useful debate leading in 
the long run to more considered outcomes. 
ML systems can be used to explore choices 
and outcomes on different counterfactual 
high-level objectives, such as retribution or 
rehabilitation in justice, enabling consid- 
ered human judgments. However, it may 
in practice be impossible to specify what 
we collectively truly want in rigid code. For 
example, many local governments do not 
seem to be engaging in public consultation 
when they adopt predictive ML systems, 
such as to flag “troubled” families that are 
likely to need interventions. Although steps 
such as explicitly adding uncertainty to 
the ML objective might address this chal- 
lenge of imperfectly specified objectives in 
future, ML systems are unable at present 
to offer wisely moderated solutions to am- 
biguous objectives (15). 

Human decision-makers can make use of 
common sense or tacit knowledge, and of- 
ten override decisions indicated by an eco- 
nomic model or other formal policy analysis, 
and they will be able to do the same when 
assisted by ML. Yet, demanding that ML 
systems be explainable is likely to make the 
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trade-offs between different objectives far 
more explicit than has been the norm pre- 
viously. Ultimately, the use of explainable 
ML systems in the public sector will make 
a broader debate about social objectives 
and social justice newly salient. Providing 
explanations requires being transparent 
about the systems’ objectives — forcing clar- 
ity about choices and trade-offs previously 
often made implicitly — and how their pre- 
dictions or decisions draw on patterns re- 
vealed by a fundamentally biased social and 
institutional system. Moreover, whereas 
democratic political systems often look to 
resolve conflicts through constructive am- 
biguity—or in other words, the failure to ex- 
plain—ML systems may require ambiguous 
objectives to be resolved unequivocally. So, 
although the need for explainability cer- 
tainly poses technical challenges, it poses 
political challenges too, which have not to 
date been widely acknowledged. Yet, the 
increasing scope of ML, and progress in 
delivering explainability, in politically sa- 
lient areas of policy could shine a helpful 
spotlight on the conflicting aims and the 
implicit trade-offs in policy decisions, just 
as it already has on the biases in existing 
social and economic systems. 
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PSYCHOLOGY 


Confronting bias 


Prejudice and discrimination are deeply personal, 


but their effects are profound 


By Calvin K. Lai 


n an idyllic summer day in 2009, 
Pragya Agarwal and her 9-year-old 
daughter went shopping for a new 
school uniform. As they were walk- 
ing back to their car, an armed po- 
lice officer stopped them. The officer 
told them that a customer had reported 
them as people who “looked like shoplift- 
ers” and were “suspicious.” Agarwal and her 
daughter were eventually allowed to leave, 
but the consequences of that incident were 
enduring. Experiences like this underscore 
the prevalence of group-based biases that 
are the focus of Agarwal’s new book, Sway. 
She begins by describing the origins of 
biases. In everyday life, we often have ei- 
ther too little or too much information to 
make optimal decisions. Our biases serve as 
mental shortcuts that help us make “good 
enough” decisions in a complex world. 
From an evolutionary perspective, this abil- 
ity to quickly separate good from bad and 
friend from foe was essential for survival. 
These biases, however, are prone to system- 
atic errors that can have grave consequences 
in modern society. Take our preference for 
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similarity: We seek out information that con- 
firms views we already hold, prefer to live 
with people who are like us, and discriminate 
against those who do not share our views. 
We also prefer the status quo, meaning that 
we are biased toward current conditions, 
even when they reinforce the 
oppression of a minority group. 

Agarwal documents biases 
across many social distinctions. 
She notes the double standards 
to which women in leadership are 
held, health care workers’ beliefs 
that Black individuals experience 
less severe pain than non-Black 
individuals, and the insidious ste- 
reotype of Asian people as “model 


Unraveling 


uncenscious bias 


Pragya Agarwal 


Sway: Unravelling 
Unconscious Bias 


Awoman speaks to security personnel in Washington, 
D.C., on 3 June 2020. Biases can affect how individual 
law enforcement officers interact with citizens. 


that goes beyond its original meaning (7). She 
assumes that many social disparities are nec- 
essarily caused by unconscious biases. How- 
ever, many of the disparities she describes 
are better explained by structural causes 
or conscious prejudices. She writes, for ex- 
ample, that “implicit aversive attitudes” are 
an explanation for racial housing segrega- 
tion. Racial housing segregation can often 
be more parsimoniously explained by insti- 
tutional forces, such as a history of redlin- 
ing, exclusionary zoning regulations, and 
economic inequalities. When psychologi- 
cal biases are considered, there is clearer 
evidence of real estate agents consciously 
steering homebuyers to certain neighbor- 
hoods on the basis of the homebuyers’ race 
rather than as a result of unconscious bias. 
Another form of concept creep is in the 
use of the term “unconscious” to encompass 
biases that are subtle but conscious. Some 
may just be hidden (for example, an avowed 
racist who lies about the reasons for treating 
a racial minority poorly), or they may be fast 
to arise (for example, a visceral reaction to 
a person who speaks with a foreign accent). 
In the book’s final chapter, Agarwal de- 
scribes approaches for debiasing, including 
raising awareness of bias, confronting oth- 
ers tactfully, removing biasing information 
from job applications, perspective-taking, 
and role models. Many of these approaches 
have firm scientific backing and a track 
record of success in reducing prejudice 
and discrimination. But most 
of what she offers is psychologi- 
cally and individually oriented. 
If the disparities Agarwal dis- 
cusses have structural causes, 
then these individualistic ap- 
proaches may fall short. 
Promoting equality takes time, 
effort, and systemic change. The 
enduring influence of structural 
factors, such as unequal educa- 


minorities.”” She describes how oe tion and wealth, creates social 
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how elderly people are perceived 
as burdens, and how accents can confer a 
host of hidden stereotypes. Although her ap- 
proach is exhaustive, it falters in its synthesis. 

Research tells us that prejudice, stereotyp- 
ing, and discrimination are pervasive. Less 
attention is paid to how to connect these 
findings together. What common psychologi- 
cal principles underlie the prevalence of un- 
consciously biased discrimination? 

Isuspect that Agarwal’s lack of emphasis on 
common principles is due to concept creep: 
an expansion of the idea of unconscious bias 
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escaped. At the same time, struc- 
tural inequalities depend on the countless 
individual choices that people make for oth- 
ers in everyday life. An integrated perspec- 
tive that deeply considers the relationship 
between individuals and society would have 
helped illustrate a key lesson of research on 
inequality: Bias is deeply personal, but it is 
also universal. 
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Like other species, we have always been on the move 


Human migration and cultural exchange represent progress, not peril, argues a journalist 


By Khalid Koser 


tis ironic to be reviewing Sonia Shah’s 
latest book, a thoughtful and thought- 
provoking defense of migration, under 
coronavirus disease 2019 (COVID-19) 
lockdown. One irony is that this is the 
perfect time to be reading her excellent 
earlier book, Pandemic: Tracking Conta- 
gions, from Cholera to Ebola and Beyond. 
Another is that migration—at least by 
people and across borders—has virtually 
come to a standstill. But this is no reason 
to shelve The Next Great Migration: The 
Beauty and Terror of Life on the Move. In- 
ternational migration will rebound. 

Shah would probably contend that this 
is because people have never been fixed 
in place and that their migration is natu- 
ral. One of the more original aspects of 
her book is that it depicts human migra- 
tion in the context of migration by plants, 
animals, and even viruses. My analysis is 
more prosaic. I believe that the forces of 
globalization that drive migration—in- 
equality; disparities in development, de- 
mocracy, and demography; the global jobs 
crisis; segmented labor markets—will be 
exacerbated by COVID-19 and government 
responses to it. 

I also disagree with the book’s premise 
that, the current global standstill notwith- 
standing, climate change will trigger a new 
great migration. Most sensible scholars 
agree, of course, that the effects of climate 
change will accelerate migration. But there 
remains little consensus concerning how 
many people will be forced to move, where 
they will move from and to, or how soon 
this migration will happen. But whatever 
the numbers, Shah is right to assert that 
we need to start respecting the rights of 
migrants and seizing the opportunities 
of migration. 

Some of the book’s most powerful sec- 
tions reflect on the author’s own migrant 
heritage: Fifty years after her parents’ ar- 
rival in the United States, she recounts, 
their migration remains the central fact 
of their lives. She also shares the stories 
of migrants whom she has met around 
the world in the course of her research. 


The reviewer is at the Maastricht Graduate School 
of Governance, Maastricht University, 6211 AX Maastricht, 
Netherlands. Email: k.koser@gcerf.org 
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She speaks, for example, with individu- 
als who have fled Afghanistan and Eritrea 
and taken the precarious journey across 
mountain ranges, deserts, and the sea to 
Europe, as well as with Latin American 
migrants who have made it to the United 
States without authorization. Shah does 
not overdramatize or sentimentalize their 
stories. Instead, the reader is left with an 
overwhelming sense that people who mi- 
grate out of desperation are resigned to 
experiencing great hardship as a means to 
improve their lives. 

Shah seeks to normalize human migra- 
tion by situating it within the broader con- 
text of other species’ migratory behavior. 
Humans, she contends, do not belong in a 
particular place any more than do butter- 
flies, fish, or trees. She demonstrates how 
migration has been integral to the evolu- 
tion of humankind, by injecting change 
and innovation into cultural practices. The 
mixing of people from different places rep- 
resents progress, not peril. 

Shah methodically dismantles the racial 
“science” that still underlies certain atti- 
tudes toward those who migrate and re- 
jects arguments for controlling migration 
on the grounds that it could potentially 
lead to overpopulation, an idea that origi- 
nated in the work of Thomas Malthus. It 
is not particularly groundbreaking to take 
such attitudes and assumptions to task, 
but Shah does so in an engaging manner, 
weaving history with geography and story- 
telling with science. 


THE NEXT 
GREAT 
MIGRATION 
Re. 


The Next Great Migration: 
The Beauty and Terror of 
Life on the Move 

Sonia Shah 

Bloomsbury, 2020. 400 pp. 
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Still, I think Shah overestimates the 
importance of such ideas and beliefs in 
explaining the xenophobia, discrimina- 
tion, and antimigrant sentiment and poli- 
cies that abound today, which I believe 
are more driven by perceptions about the 
economic impact and security implica- 
tions of immigration. I do, however, agree 
with her pessimistic prognosis that unless 
we shift attitudes, our default response to 
more migration will be to build more walls 
and enact more stringent border controls. 
Her closing chapter is a grim litany of mi- 
grant deaths and the detention of immi- 
grant children. 

Countless scholars, analysts, and re- 
searchers have produced evidence that 
migration overwhelmingly benefits econ- 
omies and societies and that there is a 
higher rate of criminality and violent ex- 
tremism among nationals than among 
migrants, but seemingly to little effect. 
Perhaps Shah’s more fundamental plea— 
that migration is normal, that we are all 
migrants, and that, like nature, migration 
can be both beautiful and terrifying—will 
have more traction. ! 
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Humankind: 

A Hopeful History 

Rutger Bregman 

Little, Brown, 2020. 480 pp. 


Difficult as it might be to 
believe, humans are hardwired 
for kindness, cooperation, 
and trust, argues historian 
Rutger Bregman in his latest 
book. This week on the 
Science podcast, Bregman 
discusses the evolutionary 
origins of humanity's good- 
ness and its implications for 
our future. 


10.1126/science.abd0367 
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The COVID-19 pandemic has revealed vulnerabilities in waste management chains, which could hinder disease containment and increase environmental pollution. 


Edited by Jennifer Sills 


COVID-19’s unsustainable 
waste management 


The coronavirus disease 2019 (COVID-19) 
pandemic has led to an abrupt collapse of 
waste management chains. Safely manag- 
ing medical and domestic waste is crucial 
to successfully containing the disease (J). 
Mismanagement can also lead to increased 
environmental pollution. All countries 
facing excess waste should evaluate their 
management systems to incorporate disas- 
ter preparedness and resilience. 

Wuhan, the COVID-19 epicenter of China, 
experienced a massive increase of medical 
waste from between 40 and 50 tons/day 
before the outbreak to about 247 tons on 
1 March (2). Cities such as Manila, Kuala 
Lumpur, Hanoi, and Bangkok experienced 
similar increases, producing 154 to 280 tons 
more medical waste per day than before the 
pandemic (3). Meanwhile, the widespread 
lockdown has caused a substantial increase 
in domestic waste in the United Kingdom 
(4). These large amounts of waste require 
collection and recycling, both of which 
are compromised as a result of manpower 
shortages and efforts to enforce infection 
control measures (5, 6). 

Disrupted services have led to waste 
mismanagement increases of 300% in 
some rural UK communities (7). With 
fewer options available, traditional waste 
management practices such as landfills 
and incineration are replacing more 
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sustainable measures such as recycling, 
with adverse effects on the environment 
(8). The U.K. Environment Agency further 
threatens the environment by allowing 
temporary storage of waste and incinera- 
tion ash at sites that have not been granted 
a permit, as is usually required (9, 10). 

To address the overflow of medical 
waste, the United Kingdom and other 
affected countries should install mobile 
treatment systems near hospitals and 
health care centers (2). The design and 
analysis of sustainable waste management 
chains, including logistics, recycling, and 
treatment technologies and policies, should 
be prioritized (17). To reduce the socio- 
economic and environmental impacts of 
waste management, the whole system must 
be considered, including waste generation, 
collection, transport, recycling and treat- 
ment, recovered resource use, and disposal 
of remains. Protecting waste management 
chains will help achieve sustainable cities 
and communities as outlined in the UN 
Sustainable Development Goals (72). 
Siming You", Christian Sonne’, Yong Sik Ok*** 
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Misguided forest action in 
EU Biodiversity Strategy 


After failing to achieve the 2020 Aichi 
biodiversity targets, governing bodies are 
preparing targets for the future. In doing 
so, they must acknowledge that effective 
policies address not only the quantity of 
protected areas but also the quality of 
protection and the management of the 
surrounding matrix. The European Union’s 
Biodiversity Strategy for 2030 (1), launched 
in May, is an opportunity to enact success- 
ful conservation. The Biodiversity Strategy 
commits to protecting the European Union’s 
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remaining forests and restoring forest 
ecosystem resilience. However, the plans for 
implementing these goals are misguided. 

The EU Strategy commits to “strictly pro- 
tecting...primary and old-growth forests.” 
Known primary forests represent 0.7% of 
Europe’s forest area, of which only 46% 
are strictly protected (2). Strictly protect- 
ing the remaining forests is the right 
strategy, given that forest management 
seriously threatens remaining primeval 
and old forests in Europe, including the 
Biatowieza Forest, Carpathian forests, and 
Scandinavian Mountains Green Belt (3, 4). 
However, the EU Strategy overlooks the 
need for spatial coherence. Tiny protected 
patches of forest will be of little relevance if 
intensive forestry prevails in the surround- 
ing matrix. An ecosystem-based approach 
to forest management must complement 
protection efforts. 

The EU Strategy also commits to restor- 
ing forest ecosystems but only offers 
planting more than 3 billion trees as a 
concrete action. Planting trees in forest 
habitats may have detrimental effects (5, 
6). The EU Strategy doesn’t specify what 
kinds of trees should be planted, a neces- 
sary detail given that the forestry sector in 
most countries increasingly undermines 
ecosystem functionality by introducing 
exotic species believed to be better adapted 
to future climates (7). 

Instead of planting trees, conservation- 
ists should focus on reducing the rate of 
forest degradation (namely, tree harvesting) 
and supporting natural renewal processes. 
The EU Strategy does not consider natural 
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restoration of spontaneously regenerating 
forests, the surface of which has decreased 
7% worldwide over the past 30 years (8). 
Simply allowing the forests to naturally 
regrow often results in more trees at much 
lower costs than planting (6). The EU 
Strategy should advocate a hands-off strat- 
egy for safeguarding ecosystemic responses 
to degradation and environmental change. 
Policies should support the natural recruit- 
ment and selection for trees with greater 
resistance to insect attacks or extreme 
events (9) and biomass-rich forests with 
closed canopies (0), and they should pre- 
vent road construction through valuable 
forest patches (77). Actions aimed at pro- 
moting the adaptation of forests to global 
change and increasing their resilience 
should be based on all available ecological 
science and require a more complex view 
than a simplistic planting strategy. The EU 
Forest Strategy planned for 2021 should 
focus on developing a holistic approach 
with a clearly defined timetable of actions. 
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Afforestation falls short 
as a biodiversity strategy 


The recent EU Biodiversity Strategy for 
2030 (1) recognizes the importance of 
biodiversity for increasing our resilience 
to natural disasters and pandemics and, 
thus, for human well-being. Although 
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it proposes ambitious measures such as 
reversing pollinator decline and controlling 
invasive species, it also introduces the ill- 
advised idea of planting 3 billion trees. 

Massive tree plantation programs (2, 3) 
have been strongly criticized by the scientific 
community for their negative ecological and 
economic impacts and their limited role in 
climate change and CO, mitigation (4-8). 
The specific number of trees proposed in 
the EU Strategy suggests a lack of a serious, 
science-based ecological assessment of actual 
restoration needs. Meeting such a target 
could threaten biodiverse treeless ecosystems 
(4, 6, 7, 9) and would waste an opportunity 
to implement ecologically sound manage- 
ment practices to restore fully functionally 
integrated mosaics of natural, seminatural, 
and sustainable agricultural ecosystems. 

Massive tree planting could also sub- 
stantially change the fire regime, especially 
given the increasing frequency of heat 
waves and droughts in an area with high 
population density (70). The probability of 
large intense fires that threaten biodiver- 
sity and human assets is largely influenced 
by the type, amount, and continuity of 
biomass. Therefore, determining how many 
trees should be planted is less important 
than figuring out the most safe and effec- 
tive conservation strategy. 

We need to move toward optimizing our 
landscapes. A diverse mosaic of nature- 
based production systems should be 
interspersed with protected natural areas 
to maximize biodiversity, resilience, and 
ecosystem services. Trees are not synony- 
mous with biodiversity. Policy-makers and 
society need to internalize this message to 
make proper decisions in the context of 
environmental and health crises. 
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A view from my basement 
AAAS CEO reflects on his first 6 months on the job 


By Sudip S. Parikh 


On 6 January, when | first walked through the doors of our beauti- 
ful living monument to science on New York Avenue in downtown 
Washington as the American Association for the Advancement of 
Science’s new chief executive officer and executive publisher of the 
Science family of journals, | had plenty of ideas for how my first 90 
days, 6 months, and year on the job would go. 

| articulated this vision on the editorial page of the 31 January 
issue of Science in the context of “Envisioning Tomorrow's Earth,” 
the theme of our 2020 Annual Meeting. | noted the extraordinary 
advances of our era, including treating and curing disease and deep- 
ening our understanding of the Universe. | also identified some of 
the critical topics of our time in prioritizing the need to safeguard the 
scientific ecosystem for diverse voices and inclusivity and leverage 
the power of all great minds. 

At the time that editorial went to press, the United States had 
fewer than 10 confirmed cases of COVID-19 and George Floyd, 
Ahmaud Arbery, and Breonna Taylor were alive. 

When we gathered in Seattle in February to envision tomorrow's 
Earth, most of us had no idea how the tomorrows we were about 
to experience would change our Earth forever. COVID-19 is nowa 
global pandemic that has infected more than 7 million people, killed 
more than 400,000, and fundamentally changed the way the world 
operates. The tragic and needless deaths of George Floyd, Anmaud 
Arbery, and Breonna Taylor have brought to the fore the need for 
decisive and lasting action to declare the importance of Black lives, 
voices, and contributions to all aspects of our society—and, central 
to our work at AAAS, to the STEM ecosystem. 
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Parikh calls for decisive and lasting action 
to declare the importance of Black lives, voices, 
and contributions to all aspects of our society. 


Almost every generation has arrived at 
similar moments of opportunity born of 
tragedy. AAAS is uniquely positioned as an 
organization to be at the forefront of these 
generation-defining issues, and we are tak- 
ing action to make sure these moments do 
not slip away. 

Science, engineering, and medicine are not 
immune to the discrimination, subjugation, 
and silencing of minority people and voices. 
We are too often unwitting perpetuators of 
the status quo, and the reasons are deeply 
ingrained in the systems that govern the con- 
duct of these fields. When we hold up a mirror 
to the scientific enterprise, we see that it is 
not only politicians and law enforcement that 
need to be reminded that Black lives matter. 

On 10 June, AAAS participated in a grass- 
roots movement called #ShutDownSTEM 
with the goal to get our community to stop 
business as usual and consider the facts— 
and, with those facts in hand, to act. Despite 
so many in our community being learned and 
well-read, outside of the social science circles 
that have studied this issue for years, few of 
us know very much about the systemic rac- 
ism that has kept generations of Black Ameri- 
cans from realizing success in the STEM 
workforce. #ShutDownSTEM encouraged 
us to take responsibility in our own lives and 
circles of influence to be actively anti-racist 
and to recognize the time and space required 
for our Black colleagues to heal. 

#ShutDownSTENM is just one step toward following through on 
fulfilling the hope for positive change in our community and across 
society. Other steps are already under way within AAAS, and many 
more have yet to be conceived. 

For decades, under the leadership of Shirley Malcom, AAAS has 
been at the forefront of the discussion about making the STEM enter- 
prise more diverse, equitable, and inclusive. As an African American 
growing up in Birmingham, Alabama, in the 1950s and 1960s, Malcom 
went through segregated schools that were underresourced because 
of the assumption that no scientists or engineers would ever be pro- 
duced from within their walls. She defied those odds, eventually com- 
pleting her doctoral work in ecology at Penn State, dedicating her life’s 
work to achieving equality in STEM education and the workforce, and 
going on to serve as a member of the National Science Board, a trustee 
of two of our nation’s most prestigious research universities, and a 
world-renowned leader in the conversation that we've been having for 
quite some time—but to which too few have paid sufficient attention. 

Progress to increase the participation and advancement of un- 
derrepresented groups in STEM has been incremental. For the past 
10 years, AAAS, with the support of the U.S. National Science Foun- 
dation (NSF), has convened the Emerging Researchers National 
(ERN) Conference in STEM. Each year, more than a thousand un- 
dergraduate and graduate students from underrepresented com- 
munities come together to share their research and develop their 
careers—with the goal of broadening participation in STEM fields. 
In describing the kind of experience that ERN attendees have, 
Malcom has said, “For some students it’s their first time on a plane, 
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never mind their first time presenting at a scientific meeting.” 

For the past 3 years AAAS has convened, also with NSF support, the 
HBCU Making and Innovation Showcase. The event brings together 
more than 80 students and faculty from historically Black colleges and 
universities (HBCUs) for 2 days of workshops and training on inven- 
tion and entrepreneurship. The students are divided into teams that 
create an innovative solution to a community problem that relates to 
one of the United Nations Sustainable Development Goals. This year’s 
winning team, comprised of students from Clark Atlanta University and 
Morehouse College, conceived a network of communication devices for 
use during natural disasters and other emergency situations. 

Convenings like ERN and the HBCU Making and Innovation 
Showcase have surely helped. Bit by bit, person by person, they 
have encouraged many students and professionals who might have 
otherwise abandoned STEM careers to stay the course, knowing 
that their voices and contributions are valued and essential to the 
long-term success of the STEM enterprise. As Malcom posited in her 
9 May 2019 testimony before the U.S. House Committee on Science, 
Space, and Technology: “How do we ensure a steady flow of talent for 
STEM while also responding to the larger need for a workforce and 
citizenry with knowledge and skills to address emerging challenges 
and opportunities? We can only do this by expanding that pool of 
talent, tapping into the vast well of women, minorities, and persons 
with disabilities currently underrepresented in STEM.” 

We must acknowledge that our efforts thus far have fallen short of 
what is truly necessary: systemic change that transforms institu- 
tions—not just individuals. We must tackle the issue where it is most 
oppressive: deeply ingrained institutional systems. Through AAAS's 
SEA (STEM Equity Achievement) Change program, institutions of 
higher education commit to a self-reflection process with the aim 
of disentangling themselves from practices of the past that made 
inequities possible—indeed, almost inevitable. The program incentiv- 
izes institutions’ alignment with SEA Change principles by publicly 
recognizing them for their commitment to and creation of sustain- 
able systemic change through self-assessment. “It’s a transformative 
national vision,” said Paula Rayman, a sociologist at the University of 
Massachusetts, Lowell, who chairs the SEA Change advisory board. 
Now more than ever, we must embrace transformative national vi- 
sions over piecemeal, individual-focused interventions. 

The rapid response of AAAS and Science to the COVID-19 pan- 
demic also has been notable. 

Science and its family of journals provide credible, evidence-based 
information, share the latest research, and disseminate up-to- 
the-minute, science-informed news coverage. Our editorial team 
continues to deliver seminal papers showing how the structure of 
the coronavirus informs vaccine development and how the virus 
bonds to human cells, exploring the beginning of new therapies, and 
examining how the public health system and social distancing can 
mitigate the spread of COVID-19. Journalists at Science are covering 
the science and responses to the pandemic around the world, often 
highlighting aspects that are picked up by mainstream news outlets. 

Among AAAS programs focusing on COVID-19 is SciLine, which 
connects journalists with vetted scientific experts. SciLine began 
2020, its second full year of operation, with plans that included two 
“boot camps”—one to help journalists understand the science be- 
hind key electoral issues and another for reporters covering adoles- 
cent health. In the space of about 2 weeks, the SciLine team quickly 
pivoted to organizing a series of online media briefings for journalists 
on COVID-19 and developed a resources webpage to provide ready- 
to-use quotes from scientists. 

SciLine also enlisted Margaret Hamburg, past president of AAAS, 
former commissioner of the U.S. Food and Drug Administration, and 
former health commissioner of New York City. On one day in April, 
from her home office, Hamburg conducted interviews with 20 local TV 
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stations and 13 local radio stations across the country. In doing so, she 
reached nearly 1.6 million people in 17 states with timely, accurate, and 
authoritative scientific and public health information. 

AAAS's Science and Technology Policy Fellowships program has 
adopted virtual platforms to connect with current policy fellows and 
to open doors for the incoming cohort of scientists and engineers. 
The program, which places some 280 scientists and engineers 
in federal agencies and congressional offices each year, places a 
high value on in-person professional development and networking, 
switching to virtual convenings at a time when the need for fellows to 
provide scientific and technical advice is greater than ever. The team 
recently conducted thousands of interviews virtually for the incom- 
ing class and is preparing to host a virtual orientation in the fall. 


For the team that produces the 
“#ShutDownSTEM is bia Annual Meeting, moving from 
. person to virtual convenings has 
just one step toward been top of mind. After announc- 

. ing that the February 2021 meeting 
following through would be a virtual event, the team 
on fulfilling the hope has not looked back, rolling out an 
for positive change 

in our community 
and across society.” 


entirely new format for scientific 

symposia that serves as a model for 

other organizations. 

The Science for Seminaries 

program, part of our Dialogue on 

Science, Ethics, and Religion, which 
Sudip S. Parikh, assists theologically diverse seminar- 
AAAS CEO ies to integrate science into their re- 
quired courses of study, also adjusted 
the focus of one of its recent discussions, shifting it to an examina- 
tion of the science behind the spread of COVID-19 when audiences 
sing or speak in loud voices, such as at worship services. 

These initiatives display the amazing number of influential audi- 
ences we reach as a scientific society. Researchers, policy-makers, 
journalists, science communicators, students, seminarians—all of 
these groups play critical roles in our mission to advance science 
and serve society. 

| have not stepped foot in AAAS headquarters in downtown Wash- 
ington since 13 March—which means | have now spent more time lead- 
ing AAAS from my basement or my kids’ playroom than | have from 
my office. Even so, | am more confident than ever in the vitality of our 
mission and in those working daily to execute that mission. We will be 
a force for science, a force for good, and a support for one another. Our 
programs, publications, and advocacy are critical to a better and more 
just world, and what we do during this time will define a generation. 

My 31 January Science editorial called on us all to rise to the chal- 
lenges of our time to ensure that the next generation has the opportu- 
nity to rise to theirs. On so many levels, those words ring truer today. 


Screeners needed 
for journalism awards 


Scientists from the United States and abroad are needed to review 
the scientific accuracy of entries in the prestigious AAAS Kavli 
Science Journalism Awards competition. The screening sessions in 
late August and September will be online this year, opening them to 
participation by scientists beyond the Washington, D.C., area. 

We need additional screeners with expertise in virology, epidemiol- 
ogy, and public health. If you can volunteer, please contact 

Emily Hughes at ehughes@aaas.org. 
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Easier polar plunge for penguins. 


n polar regions affected by a warming climate, it is increasingly important ite) 
understand what effect yearly variations in sea ice have on animal fore 
| and. reproductive success. Watanabe et al. instrumented 175 Adéli pengu 
a sentinel species for Southern Ocean ecosystems—with activity. monitor: 
and video cameras during four Antarctic field seasons with variable amount 
of ice. During ice-free-years, penguins traveled more-by swimming rather than 


king, low 
creased in 


ig their.energetic cost per unit distance traveled. Forage z area 
-free years, and more krill were captured per unit dive time, — 


increasing foraging efficiency, growth rates, and breeding success. Adéli 
guin pores in the ae Antarctic Fegion are thus likel 


CHROMATIN SEQUENCING 
Primary architecture of 
chromatin fibers 


The organization of chromo- 
somal DNA, including the 
positioning of nucleosomes 
and nucleosome-free regions 
harboring regulatory proteins 
along single chromatin fibers, is 
fundamental to genome func- 
tion. However, most sequencing 
methods cannot elucidate this 
organization at the nucleotide 
level. Stergachis et al. present 
an approach, Fiber-seq, that 
maps chromatin fibers onto 
the underlying DNA template 
using methyltransferases to 
create a kind of stencil in fly 
and human cells. This method 
identifies chromatin structure 
at nearly a single-molecule level 
and can monitor the position of 


SCIENCE sciencemag.org 


nucleosomes. Using Fiber-seq, 
the authors identify how regula- 
tory DNA activation is related 
to nucleosome positioning and 
DNA variation. -—LMZ 

Science, this issue p. 1449 


PLASMONIC MATERIALS 
Groovy chiral gold 
particles 


Although plasmonic optical 
activity can arise from chiral 
assemblies of gold and silver 
nanoparticles, there are few 
examples of gold nanoparticles 
with intrinsic chirality and high 
optical activity. Gonzalez-Rubio 
et al. show that morphological 
chirality can be induced dur- 
ing the seeded growth of gold 
nanoparticles, particularly for 
highly anisotropic nanorods. 


Chiral additives as cosurfactants 
formed helical micelles that 
directed the seeded growth 
to create grooves that main- 
tained a chiral morphology. The 
resulting particles displayed 


Electron tomography 
reconstructions of a gold nanorod 
grown with a chiral surfactant 
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high-intensity circular dichroism 
with anisotropy factors near 

0.2 at near-infrared wavelengths. 
—PDS 


Science, this issue p. 1472 


SOLAR PHYSICS 
The flow of plasma 


inside the Sun 

The activity of the Sun, including 
the occurrence of sunspots, is 
driven by magnetic fields that 
originate from the motion of 
charged plasma beneath the 
surface. Helioseismology uses 
acoustic oscillations to probe 
the Sun's interior, analogous to 
seismology’s use of earthquakes 
to investigate Earth’s interior. 
Gizon et al. analyzed helioseis- 
mology data from 1996 to 2019, 
covering two 11-year solar cycles. 
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They measured the latitudinal 
and radial flow of plasma as 
a function of depth within the 
Sun and how it varies with time. 
The results support magnetic 
flux-transport dynamo models, 
which can explain the distribu- 
tion of sunspots over each solar 
cycle. —KTS 

Science, this issue p. 1469 


Metalens-array-based 
quantum source 


Spontaneous down-conversion 
is an exotic optical process in 
a nonlinear crystal in which a 
high-energy photon splits into 
two lower-energy photons that 
are quantum mechanically 
entangled. These entangled 
pairs are valuable commodi- 
ties for quantum information 
processing and quantum 
communications. Because the 
experimental setup is usually 
performed with bulk optical 
components, there is a need 
to decrease the size scale for 
application. Li et al. combined 
an array of specialized meta- 
lenses with a nonlinear crystal 
and show that the scale of the 
process can be shrunk sub- 
stantially. The approach should 
prove useful for developing min- 
iaturized integrated quantum 
optical technologies. —|SO 
Science, this issue p. 1487 


Who and what next? 

The coronavirus 2019 (COVID- 
19) pandemic has brought 
tighter restrictions on the daily 
lives of millions of people, but 
we do not yet understand what 
measures are the most effec- 
tive. Zhang et al. modeled virus 
transmission in Wuhan, China, in 
February 2020, investigating the 
effects of interventions ranging 
from patient management to 
social isolation. Age-mixing pat- 
terns were estimated by contact 
surveys conducted in Wuhan 
and Shanghai at the beginning 
of February 2020. Once people 
reduced their average daily 
contacts from 14 to 20 down 

to 2, transmission rapidly fell 
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below the epidemic threshold. 
The model also showed that 
preemptive school closures 
helped to reduce transmission, 
although alone they would not 
prevent a COVID-19 outbreak. 
Limiting human mixing to within 
households appeared to be the 
most effective measure. —CA 
Science, this issue p. 1481 


Awrench in the works 
of COVID-19 


Understanding the inner work- 
ings of the virus that causes 
coronavirus disease 2019 
(COVID-19) may help us to 
disrupt it. Yin et al. focused on 
the viral polymerase essential 
for replicating viral RNA. They 
determined a structure of the 
polymerase bound to RNA and to 
the drug remdesivir. Remdesivir 
mimics an RNA nucleotide 
building block and is covalently 
linked to the replicating RNA, 
which blocks further synthesis 
of RNA. The structure provides a 
template for designing improved 
therapeutics against the viral 
polymerase. —VV 

Science, this issue p. 1499 


Sex, age, and the 
macrophage 


Peritoneal macrophages 
are known to contribute 
to pathology of peritonitis, 
endometriosis, and cancer, but 
the effects of age and sex on 
the biology of these cells are 
not well understood. Bain et 
al. show that replenishment of 
peritoneal macrophages from 
the bone marrow is much higher 
in male mice than in female 
mice. This disparity results in 
marked sexual dimorphisms 
in the phenotypic identity 
of peritoneal macrophages, 
including differential expres- 
sion of a receptor involved in 
response to bacterial infection. 
These findings provide insight 
into the effects of sex and age 
on peritoneal inflammation and 
infection. —CNF 

Sci. Immunol. 5, eabc4466 (2020). 
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MOUNTAIN ECOLOGY 
Seeds of change 


Edited by Caroline Ash 
and Jesse Smith 


Climate change is expected 
to alter the ecology of 


high-altitude plants in the 
Andes, like this Andean blueberry 
(Vaccinium floribundum). 


ispersal of seeds is a vital process for static plants. 

High-altitude plant communities are especially 

vulnerable to the effects of changing climate, and it 

is important to understand how climate affects their 

dispersal. Tovar et al. investigated the seed disper- 
sal strategies of plants living close to the summits of the 
tropical Andes. Wind-dispersed species were common 
throughout; plants with unspecialized, gravity-dispersed 
seeds were more frequent at sites with low minimum 
temperatures; and plants with animal-dispersed seeds were 
more frequent at sites with milder climates. As climate 
warms in the future, the spectrum of dispersal strategies 
and the composition of these plant communities may change 
accordingly. —AMS J. Ecol. 10.1111/1365-2745.13416 (2020). 


Pol Il kicks out histone 
variant 


When we think of histone 
proteins, we generally think of 
H2A, H2B, H3, and H4 because 
they are the histones that 
package DNA within our cells. 
However, other histone variants 
perform important regulatory 
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and structural roles. Ranjan et 
al. used single-particle tracking 
of fluorescently tagged proteins 
to examine histone variants in 
yeast cells in vivo and discov- 
ered that RNA polymerase 

II (Pol Il) itself evicts H2A.Z 
from chromatin. In addition, 
the kinase Kin28/Cdk7, which 
phosphorylates serine-5 of 
heptapeptide repeats in the 
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CARBON STORAGE 


Not just for the trees 


he upland bogs, grasslands, and heath of the United Kingdom are iconic 

for their scenic beauty and distinctive natural history. They are also valu- 

able because their vegetation and top 30 centimeters of soil store roughly 

30% of the country’s carbon on 20% of its land area, much more than 

the country’s existing woodlands. Field et a/. estimate that these areas 
currently sequester 8 million metric tons of carbon dioxide equivalents annually 
by photosynthesis. However, many of these areas have been degraded by land 
management practices that favor livestock grazing and game animals, 


such as burning and deep draining, and damage to peat can trig- 
ger carbon emissions. The authors estimate that if restored 


to functioning and diverse ecosystems, the sequestration 
capacity of these special habitats could be increased by 
an extra 6 to 7 gigatons. The main obstacle to this 
is the complexity of land tenure and land-use 
culture in the United Kingdom. —CA 

Biol. Conserv. 248, 108619 (2020). 


carboxy-terminal domain of the 
Pol Il subunit Rpbl, is required 
for this eviction. These findings 
indicate a general mechanism 
coupling eukaryotic transcrip- 
tion to erasure of the H2A.Z 
epigenetic signal. —BAP 

eLife 9,e55667 (2020). 


ULTRAFAST METHODS 
Avoiding multiphoton 


artifacts 


Ultrafast pump-probe crystal- 
lography experiments with 
x-ray free-electron lasers 
have now resolved structural 
changes in several light- 
sensitive proteins. A major 
experimental question has 
been how to set up the pump 
laser to increase the occupancy 
of excited states in the crystal 
but not cause multiphoton 
excitation. Grunbein et al. 
studied how incident laser 
pulses would be refracted and 
reflected by the crystal trans- 
port medium and the crystals 
themselves. They found that 
processes that would attenu- 
ate laser power are minimal, 
whereas refraction increases 


SCIENCE sciencemag.org 


power density in a portion of 
the jet. Power-density titra- 
tions and assessment of the 
spectroscopic properties of 
each experimental setup are 
therefore essential to avoid 
multiphoton artifacts at high 
laser power. —MAF 
Nat. Methods 10.1038/ 
$41592-020-0847-3 (2020). 


ICE SHELVES 
Loss of shelf 


An unusually warm and 
prolonged inflow of warm 
ocean water intruded beneath 
Antarctica’s Filchner Ronne 
Ice Shelf in 2017. Ryan et 

al. describe a 4-year-long 

ime series of ocean tem- 
perature measurements that 
documents the surge and its 
unusual persistence. They 
discuss the hydrographic 
properties of the event and 
possible forcing mechanisms, 
suggesting that it was caused 
by anomalous summer sea ice 
melting. Much of the thinning 
of ice shelves like the Filchner 
Ronne is caused by warm water 
intrusions. Similar intrusions 
beneath other ice shelves will 


+ 


affect the future stability of the 
Antarctic Ice Sheet, a pressing 


concern in our warming climate. 


—HJS 


Geophys. Res. Lett. 
10.1029/2020GL088119 (2020). 


STRUCTURAL BIOLOGY 
The double punch of 
perchlorate 


To make thyroid hormones, 
which are critical in develop- 
ment, iodine ions (I-) must be 
transported into thyroid cells. 
This is achieved by the sodium 
ion (Na*)/I- symporter (NIS), 
which couples the energetically 
unfavorable import of one I> 
with the energetically favorable 
import of two Na*. NIS also 
transports the environmental 
pollutant perchlorate (CIO4-), 
but in this case couples trans- 
port of one ClO4> with that of 
one Na’. Based on theory and 
experiments, Llorente-Esteban 
et al. show that ClO4” binds 
not only to the transport site 
but also to a nontransport 

site. Binding at the second site 
precludes binding of one of the 
Na*. Therefore, ClO4~ not only 
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Carbon dioxide 
equestration capacity of 

UK upland grasslands 

and heaths is compromised 
by land-management 
practices, such 

as heather burning. 


inhibits I> transport by direct 
competition, it also reduces the 
driving force for I- transport 

by binding at the nontransport 
site. —VV 

Nat. Struct. Mol. Biol. 27,533 (2020). 


PHYSICS 
Charmed excitations 


Unlike the general purpose 
detectors used by the ATLAS 
and CMS experiments at the 
Large Hadron Collider, the 
LHCb experiment has equip- 
ment designed specifically to 
study particles that contain the 
so-called beauty and charm 
quarks. By tracking processes 
that result in the formation of 
a charmed lambda baryon and 
a negatively charged kaon, Aaij 
et al. (the LHCb Collaboration) 
have now uncovered evidence 
for at least two previously 
unobserved baryons. These 
baryons have been identified as 
the excited states of a Xi baryon 
consisting of a charm quark and 
two other quarks. The findings 
may help to constrain theories 
on how quarks bind together to 
form baryons. —JS 

Phys. Rev. Lett. 124, 222001 (2020). 
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CORONAVIRUS 
Reducing airborne 
transmission 


Respiratory infectious diseases 
are thought to be transmitted 
mostly through contact with 
surfaces that are contaminated 
with virus-laden droplets pro- 
duced from coughs and sneezes 
of infected individuals, but this is 
not the only transmission route. 
When people speak and breathe, 
they produce tiny droplets 
called aerosols, as well as larger 
droplets, which evaporate and 
become buoyant in airflows. 
In a Perspective, Prather et 
al. discuss the accumulating 
evidence that virus-containing 
aerosols could lead to airborne 
transmission of severe acute 
respiratory syndrome corona- 
virus 2 (SARS-CoV-2), which 
causes coronavirus disease 
2019 (COVID-19). Given the 
lengthy presymptomatic phase 
and asymptomatic infections, 
the authors argue that wearing 
well-fitted masks, especially 
in enclosed indoor spaces, is 
important to help prevent SARS- 
CoV-2 transmission. —GKA 
Science, this issue p. 1422 


GENOMIC DUPLICATIONS 
The fate of genes after 
duplication 


Gene duplication within an 
organism is a relatively com- 
mon event during evolution. 
However, we cannot predict the 
fate of the duplicated genes: Will 
they be lost, evolve, or overlap 
in function within an organismal 
lineage or species? Kuzmin et al. 
explored the fate of duplicated 
gene function within the yeast 
Saccharomyces cerevisiae (see 
the Perspective by Ehrenreich). 
They examined how experi- 
mental deletions of one or two 
duplicated genes (paralogs) 
affected yeast fitness and 

were able to determine which 
genes have likely evolved new 
essential functions and which 
retained functional overlap, a 
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condition the authors refer to 
as entanglement. On the basis 
of these results, they propose 
how entanglement affects the 
evolutionary trajectory of gene 
duplications. —LMZ 

Science, this issue p. 1446; 

see also p. 1424 


DEVELOPMENTAL BIOLOGY 
Guiding regeneration 


Many adult organisms can 
regenerate neural circuits after 
injury. However, it is not clear 
which guidance mechanisms 
operate to promote axon path 
finding in the adult. Scimone et 
al. addressed this question by 
investigating regeneration of 
the planarian visual system (see 
the Perspective by Roberts- 
Galbraith). Distinct muscle cell 
populations were found in close 
association with photorecep- 
tor axons that, together with a 
neuron class, facilitated visual 
system assembly after diverse 
injuries or eye transplantations. 
These cells exhibited features 
similar to embryonic guide- 
post cells and were specified 
independently of eyes in precise 
locations by the action of adult 
positional information cues. 
Absence of these guidepost- 
like cells was associated with 
defective neuronal wiring in 
regeneration. —BAP 

Science, this issue p. 1447; 

see also p. 1428 


NEUROSCIENCE 
The adaptive human 
frontal cortex 


Flexibly switching between 
different tasks is a fundamental 
human cognitive ability that 
allows us to make selective use 
of only the information needed 
for a given decision. Minxha et 
al. used single-neuron record- 
ings from patients to understand 
how the human brain retrieves 
memories on demand when 
needed for making a decision 
and how retrieved memories 
are dynamically routed in the 
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brain from the temporal to the 
frontal lobe. When memory was 
not needed, only medial frontal 
cortex neural activity was cor- 
related with the task. However, 
when outcome choices required 
memory retrieval, frontal cortex 
neurons were phase-locked to 
field potentials recorded in the 
medial temporal lobe. Therefore, 
depending on demands of 
the task, neurons in different 
regions can flexibly engage and 
disengage their activity patterns. 
—PRS 

Science, this issue p. 1448 


STRUCTURAL BIOLOGY 


Engaging the nucleosome 
Cell identity is defined by gene 
expression patterns that are 
established through the binding 
of specific transcription factors. 
However, nucleosomal units 
limit access of transcription 
factors to specific DNA motifs 
within the mammalian genome. 
To study how transcription fac- 
tors bind such chromatinized, 
nucleosome-embedded motifs, 
Michael et al. focused on the 
pluripotency factors OCT4 
and SOX2. They systematically 
quantified the relative affini- 
ties of these factors at different 
motif positions throughout the 
nucleosome, enabling structure 
determination of OCT4-SOX2- 
bound nucleosomes by 
cryo-electron microscopy. OCT4 
and SOX2 bound cooperatively 
to strengthen DNA-binding 
affinity and resulted in DNA 
distortions that destabilized 
the nucleosome. This analysis 
reveals position-dependent bind- 
ing modes that were validated in 
vivo, providing insights on how 
transcription factors read out 
chromatinized motifs. —BAP 
Science, this issue p. 1460 
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STRUCTURAL BIOLOGY 
Ablueprint to 
understand cohesin 


Cohesin is a multiprotein 
complex that entraps sister 
chromatids for chromosome 
segregation and regulates tran- 
scription by extruding DNA loops 
to shape DNA organization. Shi 
et al. determined the structure 
of human cohesin bound to the 
protein NIBPL, which helps load 
cohesin onto DNA, and DNA at 
medium resolution by cryo—elec- 
tron microscopy. Two adenosine 
triphosphatase domains play a 
key role in cohesin function. The 
structure explains how NIBPL 
and DNA synergistically activate 
these domains and gives insight 
into how DNA is trapped by 
cohesin. —VV. 

Science, this issue p. 1454 


SPECTROSCOPY 
DNA circular dichroism 
in gas phase 
Circular dichroism spectroscopy 
is widely used to distinguish 
between nonidentical mirror- 
image molecules. The technique 
relies on differential absorption 
of left versus right circularly 
polarized light and therefore 
tends to require solution-phase 
samples for adequate sensitivity. 
Daly et al. now report gas-phase 
circular dichroism spectra of 
DNA oligonucleotides based 
on detection of photodetached 
electrons rather than transmit- 
ted light (see the Perspective 
by Barran). The salient spectral 
features matched those in solu- 
tion. Pairing the technique with 
mass spectrometry enables 
prior mass selection of particular 
molecules for analysis. —JSY 
Science, this issue p. 1465; 
see also p. 1426 
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EXOPLANETS 
Anearby multiplanet 
system 


Exoplanets can interact gravi- 
tationally with other objects 
orbiting the same star, affect- 
ing their evolution and stability. 
Studying these effects requires 
locating systems with multiple 
planets. Monitoring the nearby 
red dwarf star GJ 887, Jeffers et al. 
detected periodic radial velocity 
signals, indicating the presence 
of two planets on orbits with 
periods of about 9 and 22 days 
and a further candidate planet 
(see the Perspective by Davies). 
The inclinations of the orbits 
are unknown, so only minimum 
masses could be determined, but 
those were consistent with both 
planets being super-Earths— 
more massive than Earth but less 
than Neptune. This system is only 
3.3 parsecs from the Sun, which 
should facilitate follow-up with 
other techniques. —KTS 

Science, this issue p. 1477; 

see also p. 1432 


PHOTOSYNTHESIS 
Pairs of peaks stabilize 
output power 


A counterintuitive feature of pho- 
tosynthesis is that the primary 
pigments involved in absorbing 
light—for example, chlorophyll a 
and b in plants—do not all absorb 
right at the peak of the spectrum 
but instead are offset from the 
peak and each other. Arp et al. 
formulated a network model that 
explains how using pigments with 
this absorption-peak pattern can 
mitigate internal and external 
fluctuations in energy transfer, 
minimizing noise in output power 
(see the Perspective by Duffy). 
The model accurately repro- 
duces absorption peaks for three 
diverse photosynthetic systems 
from different spectral environ- 
ments. Such a mechanism may 
provide an underlying robust- 
ness to biological photosynthetic 
processes that can be further 
tuned and tweaked to adapt to 
longer-scale fluctuations in light 
intensity. -MAF 

Science, this issue p. 1490; 

see also p. 1427 
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DOG GENOMICS 
Sled dog arctic 
adaptations go far back 


Dogs have been used for sled- 
ding in the Arctic as far back as 
~9500 years ago. However, the 
relationships among the earliest 
sled dogs, other dog popula- 
tions, and wolves are unknown. 
Sinding et al. sequenced an 
ancient sled dog, 10 modern sled 
dogs, and an ancient wolf and 
analyzed their genetic relation- 
ships with other modern dogs. 
This analysis indicates that 
sled dogs represent an ancient 
lineage going back at least 9500 
years and that wolves bred with 
the ancestors of sled dogs and 
precontact American dogs. 
However, gene flow between sled 
dogs and wolves likely stopped 
before ~9500 years ago. —LMZ 
Science, this issue p. 1495 


CALCIUM SIGNALING 
Lighting a spark from 
the lysosome 


Calcium ion (Ca?*) sparks in 
vascular smooth muscle cells 
are critical for blood vessel 
dilation and for regulating 
blood pressure. Thakore et al. 
showed that these Ca** sparks 
were initiated by Ca** released 
from lysosomes through the 
cation channel TRPML1 (see 
the Focus by Nieves-Cintron et 
al.). lysosomes in contractile 
vascular smooth muscle cells 
were localized in close proxim- 
ity to this ion channel. Vascular 
smooth muscle cells from mice 
deficient in TRPML1 did not 
generate Ca** sparks, and these 
mice were spontaneously hyper- 
tensive. —WW 

Sci. Signal. 13, eabal015, eabc0993 

(2020). 


CANCER 
Intuitive CD8* T cells 
sense DNA 


Stimulator of interferon genes 
(STING) agonism is an area of 
active exploration for cancer 
immunotherapy. Li et al. exam- 
ined a DNA-sensing cascade 
called cGAS-STING in antitumor 


CD8* T cells. They observed 
dampened STING activity in 
CD8* T cells from patients with 
cancer or mice implanted with 
tumors. By contrast, STING 
signaling in transferred T cells 
supported a stem-like memory 
phenotype, which is known to 

be beneficial for responses to 
immunotherapy. Cytosolic DNA 
was enriched in activated T cells, 
and STING agonism improved 
the efficacy of adoptive cell ther- 
apy in multiple mouse models. 
These results highlight that CD8* 
T cell DNA sensing could be 
exploited for therapeutic benefit 
in immunotherapy. —LP 

Sci. Transl. Med. 12, eaay9013 (2020). 
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GENOMIC DUPLICATIONS 


Exploring whole-genome duplicate gene retention 
with complex genetic interaction analysis 


Elena Kuzmin*, Benjamin VanderSluis*, Alex N. Nguyen Ba, Wen Wang, Elizabeth N. Koch, Matej Usaj, 
Anton Khmelinskii, Mojca Mattiazzi Usaj, Jolanda van Leeuwen, Oren Kraus, Amy Tresenrider, 
Michael Pryszlak, Ming-Che Hu, Brenda Varriano, Michael Costanzo, Michael Knop, Alan Moses, 
Chad L. Myers+}, Brenda J. Andrewst+, Charles Boonet 


INTRODUCTION: Whole-genome duplication (WGD) 
events are pervasive in eukaryotes, shaping 
the genomes of simple single-celled organ- 
isms, such as yeast, as well as those of more 
complex metazoans, including humans. Most 
duplicated genes are eliminated after WGD 
because one copy accumulates deleterious 
mutations, leading to its loss. However, a sig- 
nificant proportion of duplicates persists, and 
factors that result in duplicate gene retention 
are poorly understood but critical for under- 
standing the evolutionary forces that shape 
genomes. 


RATIONALE: Quantifying the functional diver- 
gence of paralog pairs is of particular inter- 
est because of the strong selection against 
functional redundancy. Negative ge- 
netic interactions identify function- 
al relationships between genes and 
provide a means to directly capture 
the functional relationship between 
duplicated genes. Genetic interac- 
tions occur when the phenotype as- 
sociated with a combination of mutations 
in two or more different genes deviates from 
the expected combined effect of the individ- 
ual mutations. A negative genetic interaction 
refers to a combination of mutations that 
generates a stronger fitness defect than ex- 
pected, such as synthetic lethality. Here, we 
used systematic analysis of digenic and tri- 
genic interaction profiles to assess the func- 
tional relationship of retained duplicated 
genes. 


Read the 


RESULTS: To map both digenic and trigenic 
interactions of duplicated genes, we profiled 
query strains carrying single-deletion muta- 
tions and the corresponding double-deletion 
mutations for 240 different dispensable paralog 
pairs originating from the yeast WGD event. In 
total, we tested ~550,000 double and ~260,000 
triple mutants for genetic interactions, and 
identified ~4700 negative digenic interactions 
and ~2500 negative trigenic interactions. We 
quantified the trigenic interaction fraction, 
defined as the ratio of negative trigenic inter- 
actions to the total number of interactions 


Kuzmin et al., Science 368, 1446 (2020) 
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org/10.1126/ 
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associated with the paralog pair. The distri- 
bution of the resulting trigenic interaction 
fractions was distinctly bimodal, with two- 
thirds of paralogs exhibiting a low trigenic 
interaction fraction (diverged paralogs) and 
one-third showing a high trigenic interaction 
fraction (functionally redundant paralogs). 
Paralogs with a high trigenic interaction frac- 
tion showed a relatively low asymmetry in 
their number of digenic interactions, low rates 
of protein sequence divergence, and a negative 
digenic interaction within the gene pair. 

We correlated position-specific evolution- 
ary rate patterns between paralogs to assess 
constraints acting on their evolutionary tra- 
jectories. Paralogs with a high trigenic in- 
teraction fraction showed more correlated 
evolutionary rate patterns and thus 
were more evolutionarily constrained 
than paralogs with a low trigenic 
interaction fraction. Computational 
simulations that modeled dupli- 
cate gene evolution revealed that 
as the extent of the initial entan- 
glement (overlap of functions) of paralogs 
increased, so did the range of functional re- 
dundancy at steady state. Thus, the bimodal 
distribution of the trigenic interaction frac- 
tion may reflect that some paralogs diverged, 
primarily evolving distinct functions without 
redundancy, while others converged to an 
evolutionary steady state with substantial 
redundancy due to their structural and func- 
tional entanglement. 
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CONCLUSION: We propose that the evolution- 
ary fate of a duplicated gene is dictated by an 
interplay of structural and functional entan- 
glement. Paralog pairs with high levels of 
entanglement are more likely to revert to a 
singleton state. In contrast, unconstrained 
paralogs will tend to partition their functions 
and adopt divergent roles. Intermediately en- 
tangled paralog pairs may partition or expand 
nonoverlapping functions while also retain- 
ing some common, overlapping functions, such 
that they can both adopt paralog-specific roles 
and maintain functional redundancy at an 
evolutionary steady state. 
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Complex genetic interaction analysis of duplicated 
genes. The trigenic interaction fraction, which incor- 

porates digenic and trigenic interactions, captures the 
functional relationship of duplicated genes and follows 
a bimodal distribution. Paralogs with a high trigenic 

interaction fraction are under evolutionary constraints 
reflecting their structural and functional entanglement. 
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Exploring whole-genome duplicate gene retention 
with complex genetic interaction analysis 


Elena Kuzmin?2*+, Benjamin VanderSluis**, Alex N. Nguyen Ba*®+, Wen Wang’, Elizabeth N. Koch®, 
Matej Usaj', Anton Khmelinskii°§, Mojca Mattiazzi Usaj', Jolanda van Leeuwen’, Oren Kraus‘, 
Amy Tresenrider’, Michael Pryszlak'?, Ming-Che Hu’, Brenda Varriano’, Michael Costanzo’, 

Michael Knop®®, Alan Moses*®”, Chad L. Myers°#, Brenda J. Andrews'#, Charles Boone’7# 


Whole-genome duplication has played a central role in the genome evolution of many organisms, 
including the human genome. Most duplicated genes are eliminated, and factors that influence the 
retention of persisting duplicates remain poorly understood. We describe a systematic complex genetic 
interaction analysis with yeast paralogs derived from the whole-genome duplication event. Mapping 

of digenic interactions for a deletion mutant of each paralog, and of trigenic interactions for the double 
mutant, provides insight into their roles and a quantitative measure of their functional redundancy. 
Trigenic interaction analysis distinguishes two classes of paralogs: a more functionally divergent subset 
and another that retained more functional overlap. Gene feature analysis and modeling suggest 

that evolutionary trajectories of duplicated genes are dictated by combined functional and structural 


entanglement factors. 


ost eukaryotic genomes, including the 

human genome, contain a substantial 

fraction of duplicated genes (J-7). Gene 

duplication is generated by two main 

mechanisms: segmental duplication 
(small-scale duplication) due to error-prone 
DNA replication, and simultaneous duplication 
of all genomic segments (whole-genome dupli- 
cation) due to a variety of polyploidy events 
(3, 8). Gene duplication provides a source of 
new genes (9), and duplicate retention may 
lead to the development of specialized func- 
tional modules involving paralogous pro- 
teins through “subfunctionalization,” which 
promotes biological complexity (10). Nonethe- 
less, after duplication most paralogs are elimi- 
nated from the genome because the functional 
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redundancy of duplicated genes is evolution- 
arily unstable, as one gene copy may accumu- 
late intrinsically deleterious mutations and be 
removed from the genome by selection (17). 
However, a significant fraction of duplicates 
has been retained over the course of evolution. 
Thus, understanding the molecular mecha- 
nisms that underlie duplicate gene retention 
may provide insights into the evolutionary 
forces that shape genomes. 

About 100 million years ago, the budding 
yeast Saccharomyces cerevisiae arose from a 
whole-genome duplication (WGD) event, and, 
after a massive gene loss, retained 551 dupli- 
cate gene pairs (6, 7, 12). Quantifying the func- 
tional divergence of each paralog pair is of 
particular interest because of the strong selec- 
tion against functional redundancy. Paralog 
functional divergence has been estimated by 
the rates of evolutionary divergence of coding 
and regulatory regions (6, 12-74), Gene Ontol- 
ogy (GO) semantic distance (15-17), metabolic 
flux analysis (18, 19), similarity of gene expres- 
sion profiles (3, 10, 20-23), changes in the en- 
coded protein abundance of one sister upon 
perturbation of another (24), and analysis of 
similarity of partners within the protein-protein 
interaction network (25). 

Genetic interaction analysis provides a power- 
ful means to directly capture the functional 
relationship between duplicated genes. Ge- 
netic interactions occur when a combination 
of mutations in different genes results in 
an unexpected phenotype, deviating from a 
model based on the integration of the individ- 
ual mutant phenotypes. A negative genetic 
interaction occurs when a combination of mu- 
tations leads to a more severe fitness defect 
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than expected (26). An extreme example of a 
negative digenic interaction is synthetic le- 
thality, which occurs when two mutations, 
neither of which is lethal on its own, combine 
to generate an inviable double-mutant pheno- 
type (27, 28). Negative genetic interactions 
often occur between genes that impinge on a 
common essential function. A positive genetic 
interaction occurs when a combination of mu- 
tations results in a phenotype that is less severe 
than expected from the phenotypes asso- 
ciated with the single mutants. For exam- 
ple, digenic suppression is observed when a 
double mutant exhibits a greater fitness than 
the sickest single mutant (29). A previous global 
digenic interaction network in yeast identi- 
fied ~550,000 negative and ~350,000 posi- 
tive genetic interactions (30). The profile of 
genetic interactions for a specific query gene 
provides a quantitative measure of function 
(31, 32), and a network based on genetic in- 
teraction profile similarity reveals a hierar- 
chy of functional modules, including pathways 
and complexes, bioprocesses, and cellular com- 
partments (30). 

A systematic analysis of the digenic inter- 
actions between duplicated gene pairs in yeast 
revealed functional redundancy, whereby ~30% 
of pairs interacted (relative to ~3.6% for random 
gene pairs) (33-37). However, this observation 
may indicate that the overall contribution of 
duplicate retention to the ability of an organism 
to tolerate mutations, known as mutational 
robustness, is relatively low because total func- 
tional compensation is only observed for a 
minor fraction of duplicates (38). Mechanisms 
that drive duplicate retention may be influ- 
enced by gene dosage effects or by functional 
divergence through asymmetric evolution (33). 
Retention of duplicates may result from sub- 
functionalization, such that duplicates degen- 
erate in some of their function differentially 
and result in a pair of genes that function fully 
as the single ancestral copy; this outcome is 
postulated by the duplication-degeneration- 
complementation model (39). Evidence of 
functional partitioning of ancestral functions 
includes but is not limited to biochemical 
function (40), gene expression regulatory ele- 
ments (22), and subcellular localization pat- 
terns (41). 

Here, we expanded upon the use of genetic 
interaction profiles to capture the functional 
relationship of duplicated genes. We compared 
trigenic interactions for double-mutant query 
strains deleted for both members of nones- 
sential paralog pairs to the corresponding 
digenic interaction profiles for each single- 
mutant sister gene, and we quantified a spec- 
trum of functional redundancy among paralogs. 
A correlative analysis of the gene features 
suggests that the evolutionary trajectories 
of retained duplicated genes can be driven by 
genes encoding functionally and structurally 
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constrained proteins, which we refer to as 
“entangled.” 


Mapping trigenic interactions 
for duplicated genes 


We constructed 240 double-mutant query yeast 
strains, each deleted for a pair of nonessential 
WGD paralog genes (tables S1 to S4). These 
are dispensable paralog pairs and represent 
44% (240/551) of unique WGD paralog pairs 
(72), a number of which were not included 
because the pair was either essential or re- 
fractory to double-mutant query strain con- 
struction (42). Using colony size as a proxy for 
cell fitness, we measured the growth pheno- 
types of the set of 240 double-mutant query 
strains and the corresponding 480 single mu- 
tants (table S5) (42), which correlated well 
with previous large-scale measurements of 
single-mutant fitness (Pearson correlation co- 
efficient 7 = 0.51, P = 3 x 10°°°) and double- 
mutant fitness (r = 0.72, P = 2 x 10°) (30) 
(fig. S1, A to D). 

We used the set of single- and double-mutant 
query strains to score digenic and trigenic in- 
teractions, respectively, using trigenic synthetic 
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genetic array analysis (t-SGA) (Fig. 1). We crossed 
the queries to a diagnostic array of nonessential 
gene deletion mutants and essential gene mu- 
tants, carrying temperature-sensitive alleles, 
which span all major cellular processes and 
cover ~1200 genes representing ~20% of the 
yeast genome (37). In total, we examined 
537,911 double and 256,861 triple mutants for 
genetic interactions. Query strains deleted 
for an individual paralog gene were screened 
for digenic interactions, and double-mutant 
query strains deleted for both paralogs were 
screened for trigenic interactions in two rep- 
licate screens with four colonies per screen 
(fig. S2, A to C) (37). Negative and positive in- 
teractions were quantified for digenic and 
trigenic interactions (30, 32), which were de- 
termined from validation of trigenic interac- 
tions of the CLNI-CLN2 double-mutant query, 
as previously described (37). This resulted in 
an estimated recall (sensitivity) of ~60% and a 
precision of ~75% (37). Additionally, we used 
replicate screens to independently estimate 
the false discovery rate (FDR) as a function of 
recall, which resulted in a consistent estimate 
of >75% precision (<25% FDR) (fig. S2D). 
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Fig. 1. Trigenic synthetic genetic array (t-SGA) analysis for paralogs. (A) An illustration of the 

t-SGA experimental approach in which a query set of 240 dispensable paralog pairs originating from 
whole-genome duplication (WGD) in yeast was screened for trigenic interactions. Three types of screens 
were carried out in parallel, whereby triple-mutant fitness was estimated by crossing a double-mutant 
query strain deleted for both paralogs (light and dark blue solid circles) into a diagnostic array of single 
mutants (black solid circles) (37). After induction of meiosis in heterozygous triple mutants, sequential 
replica-pinning steps are used to select haploid triple-mutant progeny. Single-mutant control query strains 
are screened in parallel to estimate paralog-specific double-mutant fitness. (B) We used the t-SGA scoring 
method to identify trigenic interactions quantitatively by combining double- and triple-mutant fitness 
estimates derived from colony size measurements (37). The digenic interaction score is expressed as ej, 
where fj; is the observed double-mutant fitness, and the expected double-mutant fitness is expressed 

as the product of single-mutant fitness estimates fif. The trigenic interaction score is expressed as tx, where 
fix is the observed triple-mutant fitness and fff, is the triple-mutant fitness expectation expressed as the 
product of three single-mutant fitness estimates. 
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This analysis identified 4650 negative and 
2547 positive digenic interactions, as well as 
2466 negative and 2091 positive trigenic in- 
teractions (tables S1, S2, and S4). About one- 
third of negative and one-fourth of positive 
trigenic interactions were of the “novel” class, 
identifying connections that were not observed 
in their corresponding digenic interaction net- 
work (Fig. 2). Indeed, for 129 paralog pairs, 
the corresponding single genes displayed only 
sparse digenic interaction profiles on the global 
genetic interaction network (30) (table S6). 
However, they exhibited more novel trigenic 
interactions (~40% negative and ~31% positive 
trigenic interactions) than average, indicating 
that paralog pair trigenic interactions expanded 
the known global genetic interaction network. 
The remaining two-thirds of negative and three- 
fourths of positive trigenic interactions over- 
lapped a previously identified digenic interaction 
and thus represent a “modified” class of tri- 
genic interactions (Fig. 2). Paralogs with more 
negative trigenic interactions than digenic in- 
teractions showed predominantly novel and 
modified negative trigenic interactions, which 
overlapped exclusively with negative digenic 
interactions indicating functional redundancy 
(fig. S3A). 


Genetic interaction profiles highlight 
functional divergence of duplicated genes 


The functional relationship between paralogs 
should be captured by their negative genetic 
interactions. Consistent with previous obser- 
vations (32, 33), we observed that duplicates 
showed fewer negative digenic interactions 
than singletons (Wilcoxon rank-sum test, P = 
6 x 107°; fig. S3B), which is suggestive of a 
general trend in which paralogs retain func- 
tional redundancy. We predicted that highly 
divergent gene pairs should exhibit a rela- 
tively high number of paralog-specific nega- 
tive digenic interactions (Fig. 3). In contrast, 
functionally overlapping paralogs should be 
biased toward trigenic interactions and should 
display few paralog-specific interactions (Fig. 
3). To assess this possibility, we computed the 
trigenic interaction fraction, defined as the ratio 
of negative trigenic interactions to the total 
number of all negative interactions (digenic 
and trigenic) associated with the paralog pair 
(Fig. 3). 

From this analysis, we observed that double 
mutants involving duplicates showed a range 
of trigenic and digenic interaction degrees. 
This indicated that complex genetic inter- 
actions may reveal their functional redundancy 
(fig. S3B). For example, SKT7-HBS1 is a paralog 
pair that showed a relatively low trigenic in- 
teraction fraction (Fig. 3). Although the SK77 
and HBSI gene products both recognize stalled 
ribosomes and initiate mRNA degradation, 
they do so in a different manner (43). Ribosomes 
that stall upon encountering an in-frame stop 
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Fig. 2. Distribution 
of different types of 
trigenic interactions 
for paralogs. The dif- 
ferent types of trigenic 
interactions for all 
paralogs are compared 
in a pie chart. Nega- 
tive [(t or €) < -0.08, 
P < 0,05] and positive 
(t or €) > 0.08, P< 
0.05] genetic interac- 
tions are shown in 
blue and yellow, 
espectively. A trigenic 


pos. digenic 
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interaction between a double-mutant query and the array strain is called “novel” (dark blue/dark yellow) if there 
is no significant digenic interaction between either single-mutant control query and the array strain or 
between the query gene pair. Trigenic interactions that overlap with one or more negative or positive digenic 
interactions are called “modified” and are further classified by the type of digenic interaction. All trigenic 
interactions of double-mutant query strains that show a negative or a positive digenic interaction between the 
members of a query gene pair (P;-P2) (|e| > 0.08, P < 0.05) are considered “modified.” Interactions may 
be further classified by digenic interactions (if any) between a single-mutant query control strain and the 
array strain (P; and/or P2-A negative, P; and/or P2-A positive). Digenic interactions of the same sign are in 
medium blue/yellow, digenic interactions of the opposite sign are in light blue/yellow, and mixed positive 


and negative digenic interactions are depicted in different shades of gray according to whether their modified 


trigenic interactions are positive or negative. 


codon at the 3’ end of a transcript are recognized 
by Ski7, which in turn recruits the RNA exosome 
(44). In contrast, ribosomes stalled within the 
coding region of a transcript, possibly because 
of an unusual structural conformation or dam- 
age in the mRNA, are recognized by Hbs1, 
which initiates mRNA cleavage in an RNA 
exosome-independent manner (45). Our model 
(Fig. 3) predicts that the digenic interaction 
profiles of the paralogs should reflect their 
independent function. Indeed, SKI7 showed 
digenic interactions with genes involved in 
mRNA 3’ end protection and 5’-3’ mRNA de- 
cay, such as PAT] and LSM1, whereas HBS1 
interacted with numerous genes involved in 
ribosome biogenesis and recycling. Thus, the 
low trigenic interaction fraction of the SKI7- 
HBS1 gene pair (Fig. 3) appears to reflect 
the functional divergence of these paralogs 
(39, 40, 43). 

Conversely, the MRS3-MRS4 duplicate pair 
showed a high trigenic interaction fraction 
(~0.85) (Fig. 3). These paralogs are members 
of the eukaryotic-specific mitochondrial car- 
rier family, which transports compounds, in- 
cluding nucleotides, amino acids, carboxylates, 
small inorganic ions, and vitamins, across the 
inner mitochondrial membrane linking the 
cytosolic and mitochondrial biochemical path- 
ways (46). MRS3 and MRS4 encode highly 
similar mitochondrial carrier proteins with 
high affinity for Fe?*, which they transport 
across the inner mitochondrial membrane 
(47). The corresponding vertebrate homolog, 
mitoferrin, is involved in erythropoiesis by 
maintaining mitochondrial iron homeosta- 
sis (48). The MRS3-MRS4 trigenic interactions 
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involved genes related to cell redox homeo- 
stasis, such as GRX3, TSAI, and TRX3. The 
processes that regulate Fe?* homeostasis are 
important components of the cellular defense 
mechanism against oxidative damage. Indeed, 
the MRS3-MRS4 trigenic interactions were 
also enriched for genes involved in DNA repli- 
cation and repair, including genes encoding 
members of the Rad51-Rad57 complex (RAD5I, 
RAD54, RAD55, RAD57), the Rad5-Rad6-Rad18 
complex (RAD5), the DNA replication factor 
C complex (CTF8, CTFI8), the MRX complex 
(MREII, XRS2), the Holliday junction resolvase 
complex (MUS81, MMS4), and the nucleotide- 
excision repair factor 3 complex (7FBI, SSL1) 
(49). Together these examples illustrate how 
the trigenic interaction fraction may reflect 
the degree of functional overlap of paralogs. 


Distribution of trigenic interaction fraction 
among retained paralog pairs 


In total, we measured the trigenic interaction 
fraction for 161 paralog pairs that showed at 
least six total trigenic or digenic interactions 
(table S7). These paralog pairs displayed a range 
of trigenic interaction fraction values (Fig. 4A) 
that showed a distinctly bimodal distribution, 
with 114 paralog pairs exhibiting a relatively low 
trigenic interaction fraction (below 0.4) while 
a smaller subset of 47 paralogs displayed a 
higher trigenic interaction fraction (above 
0.4). This finding suggests that comparison of 
digenic and trigenic interaction profiles is an 
effective way to differentiate paralog pairs and 
can provide insight into the extent of func- 
tional overlap between members of a given 
duplicated gene pair (Fig. 4A and table S7). 
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This distribution of trigenic interaction did 
not differ for subsets of duplicated genes that 
originated by distinct mechanisms, such as 
ohnologs, which originated from WGD, or 
homeologs, which originated from hybridiza- 
tion between species (42, 50). We confirmed 
that the genetic interaction profiles we gen- 
erated are robust to array size, in that we ob- 
served a significant correlation for the trigenic 
interactions obtained from the diagnostic ar- 
ray and the genome-wide array, which were 
derived from screening 11 double mutants 
with their single-mutant control query strains 
in two replicates each (fig. S4:). The strength 
of correlation between replicates was not af- 
fected appreciably with decreasing stringency 
for either digenic or trigenic interactions (fig. 
$2, A to C), indicating that our conclusions are 
not dependent on an interaction score thresh- 
old. The remaining 79 paralog pairs, repre- 
senting about one-third of all screened pairs, 
were characterized by sparse genetic interac- 
tion profiles. These paralogs span a diverse set 
of biological processes and tend to belong to 
larger gene families (one-sided ¢ test, P = 0.04; 
table S9), which may confer higher-order re- 
dundancy and reduce second- and third-order 
genetic interactions. 


Trigenic interaction fraction, digenic 
interaction profiles, and paralog properties 


The trigenic interaction fraction of paralog 
pairs was also associated with several funda- 
mental physiological and evolutionary prop- 
erties (Fig. 4B and tables S5, $7, and S10). 
Paralogs with a high trigenic interaction frac- 
tion tend to exhibit low asymmetry with re- 
spect to their number of digenic interactions. 
The asymmetry score measures whether the 
digenic profile of one paralog is composed of 
substantially more interactions than its cor- 
responding duplicate, and it correlates with 
divergent evolution of paralog gene sequences 
(33). Consistent with this observation, a high 
trigenic interaction fraction also correlated 
with relatively low rates of protein sequence 
divergence (Fig. 4B). We also observed a high 
trigenic interaction fraction in paralogs whose 
double mutant showed a negative digenic in- 
teraction, which is often associated with func- 
tionally related genes (Fig. 4B) (30, 37). 
Conversely, paralogs with a low trigenic in- 
teraction fraction often showed a high asym- 
metric score for their digenic degrees (Fig. 4B). 
Consistent with our hypothesis that a low tri- 
genic fraction is indicative of functional di- 
vergence (Figs. 3 and 4A), a high asymmetry 
score may reflect that one paralog has evolved 
a specialized role. For example, a paralog dis- 
playing relatively few digenic interactions un- 
der standard conditions may only be expressed 
and functional under a different environmen- 
tal condition or during a specialized devel- 
opmental program. Indeed, in the case of four 
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Fig. 3. Mapping the func- 
tional relationship of para- 
logs through their digenic 
and trigenic interactions. 
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behave in a manner con- 
sistent with the expectation 
and display fewer trigenic 
than digenic interactions. 
However, functionally 
redundant paralogs such as 
MRS3-MRS4 display a higher 
fraction of trigenic interac- 
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tions with a corresponding drop in the fraction of paralog-specific digenic interactions. The fraction of 
different types of genetic interactions is illustrated using bar graphs. The fraction of total genetic interactions 
attributed to the trigenic interactions associated with a parlA par2A double-mutant query, deleted for 

both paralogs, is shown as a dark blue bar; the fraction of digenic interactions associated with each paralog 
single-deletion mutant, parlA or par2A, is shown as a light blue bar. 


asymmetric paralog pairs, expression of the 
low-degree sister was induced during sporu- 
lation, a meiotic developmental program that 
cells enter in response to a low-carbon and low- 
nitrogen environment. Specifically, GZS7 in 
paralog pair GISI-RPH1, HES] in HESI-KES1, 
ECM in ECI-DCH, and DON] in DONI-CUE5 
showed higher meiotic gene expression (table 
S11) (67). For another seven asymmetric paralog 
pairs, the low-degree paralog was required 
for filamentous growth (52), and for another 
15 asymmetric pairs, the gene expression of 
the low-degree sister was induced under glu- 
cose starvation (table S11) (53). In total, 22 of 
63 asymmetric pairs (~35%) have a paralog 
that may have been retained for function in 
a different condition or during a specialized 
program (table S11). 

The retention of gene duplicates may also be 
related to gene dosage (54). For example, some 
duplicates appear to be maintained as a result 
of selection for high levels of expression, as in 
the case of metabolic genes that provide high 
enzymatic flux (8). WGD events also enable 
maintenance of stoichiometric balance of pro- 
tein complex members, consistent with the 
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high rate of duplication among components 
of the ribosome (35). Dosage duplicates are 
associated with a severe fitness defect when 
either one of the paralogs is deleted, as well as 
a greater digenic interaction profile similar- 
ity than other duplicates, indicating that they 
have retained functional redundancy (33, 38). 
Because dosage duplicates should have sub- 
stantial functional redundancy, we reasoned 
that they might also tend to have a higher 
trigenic interaction fraction. Although the 
paralogs with a greater digenic interaction 
profile similarity also tend to have a higher 
trigenic interaction fraction (Fig. 4C), poten- 
tial dosage duplicates (33) were found within 
both the high and low trigenic interaction frac- 
tion distributions (table S10). Thus, the func- 
tional redundancy associated with the subset 
of paralogs with a higher trigenic interaction 
fraction is probably not driven solely by dosage 
mechanisms. 

The evolution of regions important for pro- 
tein localization may lead to differential sub- 
cellular localization of some paralogous proteins 
and may play a role in their retention by en- 
abling specific paralog functions in distinct 
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parts of the cell (47, 55). However, because on 
average duplicated genes do not appear to 
evolve a relocalization more frequently than 
singletons, this may not represent a major 
mechanism driving paralog retention (56). We 
observed that paralogs with different subcel- 
lular localization patterns tended to show a 
modestly higher trigenic interaction frac- 
tion than those with the same subcellular lo- 
calization patterns (Wilcoxon rank-sum test, 
P < 0.05), which suggests that differentially 
localized paralogs may retain some functional 
overlap. 


Defining paralog function in terms 
of trigenic interactions 


To characterize the roles of paralog pairs with 
overlapping functions, we mapped their tri- 
genic interactions onto the global digenic 
interaction profile similarity network (30) 
(Fig. 5A). Using this approach, trigenic inter- 
actions associated with paralog pairs that have 
a relatively high trigenic interaction fraction 
can be examined for enrichment within defined 
bioprocesses. For example, the sbe2A sbe22A 
double mutant showed few paralog-specific 
interactions and numerous trigenic interac- 
tions, with a trigenic interaction fraction of 
~0.69 (table S7). The proteins encoded by the 
SBE2-SBE22 pair share ~51% amino acid se- 
quence identity and have relatively well- 
characterized functions in the transport of 
cell wall components from the Golgi to the 
cell surface (57). The SBE2-SBE22 negative tri- 
genic interactions involved genes enriched for 
GO annotations related to vesicle-mediated 
transport and cell wall organization (Fig. 5B) 
and include interactions with an ARF-like small 
guanosine triphosphatase (GTPase) secretion 
pathway (ARLI, ARL3, SYSI, YPT6), the exocyst 
(SEC4), and the chitin biosynthesis pathway 
(CHS5, CHS6). 

Previously, we showed that trigenic inter- 
actions, like digenic interactions, are func- 
tionally informative and are enriched among 
genes annotated to the same biological pro- 
cess (37). Mapping paralog trigenic interaction 
profiles onto the global digenic interaction 
profile similarity network enables the func- 
tional annotation of previously uncharacter- 
ized paralogs. For example, ECM13-YJRLU5W 
showed a trigenic interaction fraction of 0.77 
with 12 negative digenic and 40 negative tri- 
genic interactions, suggesting high functional 
redundancy (fig. SSA and table S7). These 
paralogs are fungal-specific and share 39% 
identity and 70% similarity at the amino acid 
sequence level. The genes that constitute the 
ECM13-YJR1I5W trigenic interaction profile 
were positioned mainly within the mitosis 
and DNA replication and repair clusters on 
the global digenic profile network (Fig. 5C). 
The ECM13-YJR115 trigenic interaction profile 
is correlated to the digenic interaction profiles 
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Fig. 4. The trigenic interaction fraction correlates with fundamental physiological and evolutionary 
properties. (A) Negative trigenic interaction fraction distribution of screened paralogs, (t or e) < -0.08, 
P < 0.05; paralogs with at least six trigenic or digenic interactions in one of the screens are considered. 


Representative examples of paralogs with a low (SKI7-HBS1) and high (MRS3-MRS4) trigenic interaction 
fraction are marked with an arrow. (B) Physiological and evolutionary properties for paralogs characterized 
by varying fraction of trigenic interactions. Spearman correlation coefficients (r) and associated P values 
are measures of the strength of the correlation between the trigenic interaction fraction and the three 
features being examined: digenic interaction degree asymmetry, sequence divergence rate, and paralog pair 
interaction strength. The correlation was measured on the entire dataset. The bar plots serve to visualize 
the trend, in which a trigenic interaction fraction cutoff of 0.4 was used to identify paralogs with low and 
high trigenic interaction fraction, based on negative interactions (t or ¢) < —0.08, P < 0.05. Means of 
specified features are shown; error bars denote SEM. (C) The distribution of global digenic profile correlation 
similarity (30) was compared for paralogs with high and low trigenic interaction fraction. A trigenic 
interaction fraction cutoff of 0.4 was used as in (B). Box plots serve to summarize the data of the 


distributions. Analyses are restricted to paralogs with at least six total trigenic or digenic interactions in one 
of the screens. Significance was assessed using one-tailed Wilcoxon rank-sum test. 


of CCT4 and CCT5, which encode members of 
the cytosolic chaperonin Cct ring complex that 
participates in the assembly of tubulin, and 
RBL2, which is involved in microtubule mor- 
phogenesis, suggesting a possible microtubule- 
related role for this paralog pair (30) (fig. S5B). 
The ecm13A yjrl1I5wA double mutant was spe- 
cifically sensitive to benomyl, a microtubule- 
disrupting agent (fig. S5, C and D), and showed a 
delay in spindle nucleation and polymeriza- 
tion (fig. S5E). Moreover, the Ecm13 protein 
interacts with Dad2 and Dad4, which are 
components of the Dam1 complex that links 
kinetochores to microtubules to facilitate chro- 
mosome segregation (58). Consistent with a 


role in spindle function and chromosome seg- 
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regation, we found that both GFP-Ecm13 and 
GFP-Yjrll5w showed a distinct nuclear local- 
ization (fig. SSF). Another poorly characterized 
pair is STB6-STB2, which encodes proteins 
that bind the S/N3 transcriptional repressor 
and may impinge on the MTC (maintenance 
of telomere capping) pathway, suggesting a 
role in aromatic amino acid permease secretion 
(30) (fig. S5, G to J). 


Correlation analysis of position-specific 
evolutionary rate patterns between paralogs 


To further explore the factors that drive paralog 
retention and produce a spectrum of functional 
redundancy, we considered the relationship 
among divergent evolution, structure, and 
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function. A negative correlation of trigenic 
interaction fraction with measures of diver- 
gent evolution may be indicative of the con- 
straints that govern this process (Fig. 4B). 
Amino acid conservation in specific positions 
indicates evolutionary constraints on amino 
acid residues that are important for protein 
function, including those involved in oligo- 
merization, protein-protein interactions, and 
protein-substrate interactions (59). We there- 
fore used the correlation of position-specific 
evolutionary rate patterns between paralogs 
as a measure of evolutionary divergence of the 
sequence constraints (42). We reasoned that if 
both paralogs share constraints on specific 
residues of protein domains during evolution 
and evolve similarly after the WGD, then they 
would be expected to be more similar to each 
other than to the pre-WGD species. On the other 
hand, if paralogs do not share constraints on 
specific residues of protein domains during 
evolution and evolve differently after the WGD, 
then they would be less similar to each other 
than to the pre-WGD species. We therefore 
designated a set of pairs with a high correlation 
of position-specific evolutionary rate patterns 
as evolutionarily constrained pairs for which 
the correlation of rates between extant sisters 
was greater than or equal to both the corre- 
lations between each sister and the pre-WGD 
homolog. Paralog pairs with little correlation 
in their position-specific evolutionary rate pat- 
terns have different sequence constraints and 
are therefore likely to have different structure 
and function (Fig. 6A and table S12). 

For example, the proteins encoded by the 
MRS3-MRS4 gene pair show a high correlation 
in their position-specific evolutionary rate pat- 
terns and also have high protein sequence iden- 
tity (~76%) (Fig. 6B and fig. S6A). Although both 
MRS3 and MRS4 encode high-affinity mito- 
chondrial Fe** transporters, the MRS3 protein 
displays an additional Cu”* transport function 
(60). The general mitochondrial carrier function 
is structurally constrained because all mitochon- 
drial carrier proteins have a tripartite struc- 
ture with three similar segments, each of which 
is ~100 amino acids in length and forms two 
membrane-spanning «o helices (fig. S7A) (67). 
Interestingly, these proteins appear to have 
properties that are favorable for retention after 
duplication because multiple mitochondrial 
carrier protein genes were retained after the 
yeast WGD. Indeed, of the 35 different mito- 
chondrial carrier protein genes in the yeast 
genome, 10 are encoded by five WGD paralog 
pairs (62). 

Our study interrogated four mitochondrial 
carrier protein WGD pairs: MRS3-MRS4, YIA6- 
YEA6, YMCI-YMC2, and ODCI-ODC2. Like the 
MRS3-MRS4 paralog pair, YIA6 and YEAG are 
connected by a negative digenic interaction 
and show a high trigenic interaction fraction, 
which is consistent with a retained functional 
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Fig. 5. The trigenic interaction fraction reveals the functional divergence 
of duplicated genes and illuminates gene function. (A to C) SAFE (70) 
analysis (A) was used to visualize regions of the global digenic interaction 
profile similarity network (30) that were enriched for genes in the trigenic 
interaction profiles of the paralog pairs SBE2-SBE22 (B) and ECM13-YJRII5W 
(C). Blue indicates the enrichment related to negative trigenic interactions 


(t < -0.08, P< 0.05). 


redundancy. In contrast, both the YMCI-YMC2 
and ODCI-ODC2 paralog pairs display rela- 
tively few trigenic interactions. Indeed, the 
YMCI-YMC2 pair displays a low correlation of 
position-specific evolutionary rate patterns and 
thus a low level of functional redundancy (table 
$12). On the other hand, ODCI-ODC2 displays 
a relatively high correlation of position-specific 
evolutionary rate patterns (table S12); how- 
ever, their functional overlap may be masked 
by the presence of other mitochondrial carrier 
proteins because they belong to a large gene 
family with multiple paralog members, ex- 
panded also by small-scale duplications (table 
S9). Moreover, we combined our genetic analysis 
with literature curated data to map a genetic 
network underlying numerous mitochondrial 
carrier protein genes, and the YMWCI-YMC2 and 
ODCI-ODC2 paralogs appear to form a highly 
connected subnetwork (fig. S8), which suggests 
that these two paralog pairs display a more 
complex functional redundancy. 

In contrast to MRS3-MRS4, the SKI7-HBS1 
proteins show a relatively low correlation in 
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their position-specific evolutionary rate pat- 
terns and thus show a low sequence identity 
(~26%) with an asymmetric rate of sequence 
divergence, whereby Ski7 appears to be di- 
verging faster than Hbs! (table S10). Detailed 
inspection of these proteins revealed that the 
Hbs1 protein resembles the pre-WGD homo- 
log, whereas Ski7 has adopted a more diver- 
gent fate, which suggests that its evolved role 
was not constrained structurally by the pre- 
WGD ancestor (Fig. 6B and fig. S6B). Despite 
retaining the EF-Tu GTP-binding domain 
(PF00009), it is present in a highly divergent 
form. Ski7 also lost critical sequences encoding 
an Hbs!I-like N-terminal motif (PF8938) and 
sequences encoding the EF-Tu C-terminal do- 
main (PF03143), highlighting the evolutionary 
divergence of Ski7 from Hsb1 and the pre-WGD 
homolog (Fig. 6B and fig. S6B). 

By calculating the level of correlation of 
position-specific evolutionary rate patterns 
between members of the duplicate pair in rela- 
tion to the pre-WGD homolog (Fig. 6A), we 
assessed the evolutionary constraints acting 
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on paralogs (Fig. 6C). We found that paralogs 
with a high trigenic interaction fraction were 
composed of a significantly higher number 
of paralogs with correlated evolutionary rate 
patterns and thus were more evolutionarily 
constrained than those characterized by a 
low trigenic interaction fraction (Fisher exact 
test, P = 0.01) (Fig. 6C). Moreover, paralogs 
that show a synthetic lethal genetic interac- 
tion are considered highly functionally redun- 
dant. This subset of essential paralogs shows 
a higher correlation in their position-specific 
evolutionary rate patterns, which suggests that 
these genes are more evolutionarily constrained 
than those within the subset of highly redun- 
dant nonessential paralogs displaying a high 
trigenic interaction fraction (Fisher exact test, 
P = 0.02) (Fig. 6C and table S13). Hence, it 
appears that some paralogs are highly struc- 
turally constrained or “entangled,” which limits 
their divergence leading to the maintenance 
of functional overlap, but presumably within 
a context that also enables the evolution of 
novel functions. 
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Modeling simulates the divergent evolution 

of paralogs with retained functional redundancy 
We also explored functional redundancy and 
paralog retention using in silico modeling in 
an attempt to test two hypotheses. Under 
the first hypothesis, the retained nonessential 
paralog pairs with a high trigenic interac- 
tion fraction—and thus a high functional 
overlap—are inherently unstable over evolu- 
tionary time and would eventually diverge 
completely, losing any common functional- 
ity. Under the alternative hypothesis, retained 
paralogs may converge to an evolutionary 


functional overlap cannot segregate certain 
functional regions without a fitness cost. We 
computationally generated “genes” of fixed 
length, in which regions of random length 
were assigned responsibility for a function, 
and a random number of such functions was 
generated for each gene, such that these func- 
tional regions were allowed to overlap. Then 
we duplicated each gene and began introduc- 
ing random “degenerative” mutations, which 
would render the affected paralog unable to 
perform any function associated with the 
mutated region (Fig. 7A). We discarded any 


functions could not be carried out by at least 
one sister, and continued simulating muta- 
tions until the pair reached steady state and 
could tolerate no additional mutations. The 
extent of overlap of functions in each ran- 
domly generated ancestral gene at the start of 
paralog evolution provides a measure of their 
initial structural and functional “entangle- 
ment,” generating a baseline from which we 
assessed their evolutionary trajectories (42). 

These simulations revealed that for a large 
fraction of paralog pairs, the mutation process 
resulted in a singleton state with only one of 


steady state, in which paralogs with retained 


lineage as unfit when any one of the original 


the sisters being retained. A sizable fraction of 
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Fig. 6. Evolution of retained overlap due to evolutionary constraints acting 
on duplicated gene sequences. (A) Schematic depiction of the analysis of 
correlated evolutionary sequence changes across paralog sequences reflecting 
evolutionary constraints on paralogs. Correlated rates of evolution for 

specific columns in multiple sequence alignments for the pre-WGD homolog 
and each paralog are denoted with a gray-to-black gradient, from low to high, 
respectively. High correlation of position-specific evolutionary rate patterns 
identifies residues with similar evolutionary constraints. Paralogs with cor- 
related rates (rpart:par2) that are greater than or equal to that of each paralog 
and with the corresponding pre-WGD (part:prewap aNd Mpar2:prewan) were 
designated as having a high correlation of position-specific evolutionary rate 
pattern, and paralogs with correlated rates (Fpart:par2) that were less than 
that of either paralog or both paralogs with the pre-WGD (rpar:prewap and/or 
lpar2:prewap) Were designated as having a low correlation of position-specific 
evolutionary rate pattern; r refers to the Pearson correlation coefficient be- 
tween the respective sequences. (B) Examples of evolutionary rates for posi- 
tions in the alignments for representative paralogs, showing a high correlation 
of position-specific evolutionary rate patterns (MRS3-MRS4) or a low correlation 
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of position-specific evolutionary rate patterns (SKI7-HBS1). The position in the 
alignment is plotted on the x axis; the rate of evolution at a particular position 
divided by the average rate of evolution for all residues in the given sister paralog 
is plotted on the y axis. The scale of the y axis has been fixed for each paralog 
pair. Pfam domains are annotated. The MRS3-MRS4 alignment shows three 
mitochondrial carrier repeats, each composed of two a helices (blue, 

Hl and H2; red, H3 and H4; yellow, H5 and H6) followed by a characteristic 
motif Pro-X-[Asp/Glu]-X-X-[Lys/Arg]-X-[Lys/Arg]-(20 to 30 residues)- 
Asp/Glu]-Gly-X-X-X-X-[Trp/Tyr/Phe]-[Lys/Arg]-Gly connecting each pair of 
membrane-spanning domains by a loop. The SKI7-HBS1 alignment shows GTP 
EF-Tu (blue) and C-terminal GTP EF-Tu (red) domains. The Hbs1-like N-terminal 
motif lies outside of the alignment window. (C) Fraction of nonessential and 
essential paralogs that show a high or low correlation of position-specific 
evolutionary rate patterns. The paralogs with low and high trigenic interaction 
fraction belong to the part of the distribution shown above; a trigenic interac- 
tion fraction cutoff of 0.4 was used, based on negative interactions (t or €) < 
-0.08, P < 0.05, and contains the set of paralogs that were used for the 
correlated evolution analysis. Significance was assessed by Fisher exact test. 
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simulations, however, ended with paralogs 
in a stable steady state in which no more mu- 
tations could be tolerated in either paralog, 
while still maintaining viability. Analysis of 
the simulation results revealed that the par- 
ticular trajectory followed by a given paralog 
pair was correlated with the level of functional 
entanglement. Specifically, paralog pairs that 
started with the highest levels of entangle- 
ment immediately upon duplication were more 
likely to revert to a singleton state. This sug- 
gests that duplicated genes generally cannot 
tolerate genetic perturbations when they lack 
functionally independent regions (Fig. 7B and 
fig. S9A). Among paralog pairs that were re- 
tained at steady state, increased entangle- 
ment at the point of duplication also led to a 
broader bias in the functional asymmetry (ra- 
tio of functional responsibilities) at steady 
state. Thus, paralogs diverge asymmetrically 
when they begin their evolutionary trajec- 


tory with a protein sequence containing ex- 
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tensive entanglement (Fig. 7C and fig. S9, B 
and C). Consistent with this observation from 
our simulations, paralog evolution can show 
asymmetric bias with respect to functional 
redundancy (Fig. 4B). 

Our modeling further revealed that as the 
extent of the initial entanglement of paralogs 
increased, so did the range of steady-state func- 
tional overlap, which is represented by con- 
strained domains at steady state (Fig.’7D and 
fig. S9D). This suggests that the bimodal dis- 
tribution of the trigenic interaction fraction 
(Fig. 4A) may indicate that one subset of WGD 
paralogs diverged substantially so that each 
of the sister paralogs has a distinct function, 
and another subset of retained WGD paralogs 
reached an evolutionary steady state despite 
retained functional overlap, perhaps as a re- 
sult of their structural and functional entan- 
glement (Fig. 7E). For example, in the case of 
SKI7-HBS1, Ski7 diverged from Hbs1 by losing 
the Hbs1-like N-terminal motif and the EF-Tu 
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C-terminal domain while retaining a highly 
diverged form of the EF-Tu GTP-binding do- 
main, reflecting a modular, structural, and 
functional organization of the protein (Fig. 6B). 
On the other hand, MRS3 and MRS4 encode 
mitochondrial carrier proteins dedicated to 
the transport of small inorganic ions, and thus 
their divergence would be predicted to occur 
in specific residues that modulate ion speci- 
ficity (Fig. 6B and fig. S7). 

We propose that the evolutionary fate of 
a duplicated gene can be governed by an 
interplay of structural and functional entan- 
glement (Fig. 7E). If a duplicated gene con- 
tains several easily partitioned functions, 
then it will most likely subfunctionalize; on 
the other hand, an entangled pair, which is 
highly restricted structurally and function- 
ally, would have a tendency to revert to a 
singleton state because one of the genes is 
predicted to quickly become nonfunctional. 
However, given multiple functions and an 
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Fig. 7. In silico evolutionary model. (A) Schematic depiction of the in silico 
evolutionary model. The pair evolves through random mutations until it reaches 
an evolutionarily stable state that can sustain no further mutations without a loss 
of function. Top, a pair at the start of the evolutionary trajectory; bottom, a 
pair that achieves a division of labor with a retention of a common function (dark 
blue blocks), the loss of which is prevented because it would compromise the 
unique functions of each paralog (yellow, light blue, red). (B to D 
fates of paralogs with functional and structural entanglement. Paralogs were 
generated to represent a range of overlapping functional domains at the onset of 
their evolutionary trajectory, and the propensity to assume specific paralog 
properties was quantified. In each case, the x axis represents bins of initial 
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functional overlap as a fraction of “gene” length at the start of the simulations 
(< 10%, 30%, 50%, 70%, 90%, 100%, respectively); the y axis depicts the 
propensities of paralogs to revert to a singleton state (B), evolve functional 
asymmetry (C), or retain functional overlap at the evolutionary steady state (D). 
(E) The structural and functional entanglement model of paralog divergence. A pair will 
evolve by subfunctionalization if it is modular and is composed of partitionable 
functions (left). A paralog pair that is very structurally and functionally entangled will 
have a high probability of reversion to a singleton state because one of the sisters 
will quickly degenerate (right). Paralogs with an intermediate level of entanglement at 
the time of duplication will tend to partition some and retain some overlapping 
functions, allowing for specialization of a common activity (center). 
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intermediate level of entanglement, a gene 
pair has a chance of partitioning or expand- 
ing some nonoverlapping functions while 
retaining others in common, and remaining 
evolutionarily stable. 


Discussion 


The complex genetic interaction network of 
dispensable WGD paralogs provides insight 
into the long-standing question of why paral- 
ogs with overlapping functions are retained on 
an evolutionary time scale. By combining both 
paralog-specific digenic interactions and the 
paralog pair trigenic interactions in a single 
metric, the trigenic interaction fraction, we 
captured the spectrum of the retained func- 
tional redundancy of dispensable paralogs. 

Definitions of functionally redundant or 
divergent paralogs using genetic interactions 
appear to be consistent with classification 
from protein-protein interaction studies (25). 
For example, in the case where a paralog is 
deleted and the sister responds by gaining 
specific protein-protein interactions, then the 
paralogs should compensate for each other’s 
loss and thus should exhibit a high trigenic 
interaction fraction. Indeed, four such paralogs 
were examined in our study and they showed 
a propensity to exhibit high trigenic inter- 
action fractions. In particular, NUP53-ASM4 
and OSH6-OSH7 showed high trigenic inter- 
action fractions of 0.49 and 0.80, respectively. 
On the other hand, some paralogs share protein- 
protein interactions that are lost for both 
paralogs when only one sister is deleted, which 
suggests that although these paralogs may 
cooperate, they do not fully compensate for 
each other (25). We examined two such known 
paralogs, PEX25-PEX27 and GSYI-GSY2, and 
they exhibited low trigenic interaction frac- 
tions of 0.19 and 0.25, respectively. Beyond 
these examples, genetic interaction profiling 
provides a functional readout and allows as- 
sessment of pairs of genes that do not have 
extensive protein-protein interaction profiles, 
and therefore it provides a complementary view 
of functional redundancy. 

Our framework enabled us to interrogate 
how WGD paralog evolution relates to the 
evolutionary stability of retained common func- 
tions and asymmetric functional divergence. 
By computing the extent of correlated evolution 
in sister paralogs (Fig. 6), we identified paralogs 
that show highly correlated position-specific 
evolutionary rate patterns and thus are under 
strong evolutionary constraints to retain some 
of their ancestral function, reflecting their 
structural and functional entanglement. This 
was further explored by our in silico model 
(Fig. 7), which predicts that low levels of en- 
tanglement are sufficient to drive asymmetric 
subfunctionalization, whereas more complex 
sequence-function relationships with higher 
structural entanglement can result in fixation 
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of functional redundancy. Indeed, our mod- 
eling shows that given some level of moder- 
ate structural entanglement and the potential 
for multifunctionality, a substantial fraction of 
duplicate pairs converge to a steady state in 
which they retain functional overlap. This re- 
sult offers a possible explanation as to the per- 
sistence of the functional overlap in paralogs, 
which is not simply due to paralogs diverging 
slowly from one another. We propose that the 
results of our in silico modeling may explain 
why the trigenic interaction fraction tends to 
follow a bimodal distribution (Figs. 4A and 7). 
The upper mode of the distribution repre- 
sents the set of duplicate pairs that will likely 
remain fixed in a partially functionally redun- 
dant state, whereas the lower mode represents 
duplicates that already have or are diverging 
in function. Because ohnologs and homeologs 
(42, 50) show the same distribution of trigenic 
interaction fraction, this model of paralog di- 
vergence and retention of functional redun- 
dancy also likely applies to gene duplicates of 
various ages and origins beyond WGD, which 
may include small-scale duplicates. 

In the simulation analysis, we modeled func- 
tions as being supported by contiguous se- 
quence domains. However, because our model 
treats every position along a “gene” as statis- 
tically independent, positions contributing 
to acommon function would not need to be 
contiguous, and the conclusions would remain 
the same for functional domains encoded by 
discontinuous sequences. Therefore, the model 
has the flexibility to capture a wide variety of 
different physiological scenarios that might 
display structural or functional entanglement, 
such as independent modular domains; lin- 
early distant contact sites within a secondary, 
tertiary, or quaternary structure; or even reg- 
ulatory regions beyond coding boundaries or 
elsewhere within the genome. It is important 
to note that this definition of structural and 
functional entanglement is distinct from the 
simple physical entanglement of proteins re- 
stricted to the basic organization of polypep- 
tide chains (63). 

The question of why subfunctionalization 
does not proceed to completion—leading to 
fixation of duplicated genes with some spe- 
cific functions, yet exhibiting a certain level of 
functional redundancy—remains an outstand- 
ing problem in evolutionary biology. Constraints 
that prevent complete divergence may allow 
paralogs to retain the ability to function in 
parallel biochemical pathways or macromolec- 
ular complexes, thus resulting in a retention 
of redundancy (18, 43, 64). More specifically, 
despite functional divergence of independent 
domains, incomplete subfunctionalization of 
paralogs could be driven by a structurally and 
functionally overlapping ancestral domain (35). 
There are few existing models of duplicate evo- 
lution that specifically address the existence 
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of redundancy in a steady state. There are 
models that address different potential modes 
of functional divergence, such as neo- or sub- 
functionalization (39, 65). However, reasons 
for the persistence of functional redundancy 
have remained elusive. It is noteworthy that 
previous simple computer simulations, which 
incorporated mutation rates of genes and 
the varying contribution of their functions to 
overall fitness, have also identified situations 
in which redundancy can be maintained in- 
definitely (66-68). It has been shown that 
paralogs that are selected to function as dis- 
tinct homomers also retain the ability to het- 
erodimerize, which may prevent functional 
divergence between paralogs (69). In general, 
our findings support fixation of overlapping 
functional redundancy for a substantial pro- 
portion of yeast paralogs. 


Methods summary 


To study the functional divergence of dupli- 
cated genes, 240 double mutants and 480 
corresponding single mutant control “query” 
strains, involving dispensable WGD pairs in 
the budding yeast Saccharomyces cerevisiae 
S288C, were constructed using PCR-mediated 
gene deletion followed by tetrad analysis. 
Paralog 1 deletion was marked with natMX4, 
while paralog 2 was deleted and replaced with 
K. lactis URA3. Single mutant control strains 
deleted for each one of the paralogs were also 
marked with the relevant control marker, which 
was inserted at the benign HO locus. Query 
strain fitness and query gene pair genetic in- 
teractions were measured using high-density 
synthetic genetic array (SGA) analysis. To ob- 
tain trigenic interactions, double mutant query 
strains along with their respective single mu- 
tant control query strains were subjected to 
trigenic-SGA analysis (t-SGA), which involves 
a number of automated replica pinning steps. 
Each query strain was mated to a diagnostic 
array of 1200 strains, consisting of deletion mu- 
tants of nonessential genes and temperature- 
sensitive alleles of essential genes, providing a 
representative view of the global digenic in- 
teraction network. Briefly, the query and ar- 
ray strains were mated on rich media, MATa/a 
diploids were selected on media containing 
G418 and clonNAT, sporulation was induced 
by transferring to media with low levels of 
nitrogen and carbon sources, and MATa meiotic 
haploid progeny was selected on haploid se- 
lection media. Triple mutants were then se- 
lected by first pinning onto haploid selection 
media containing G418, lacking uracil, and 
then onto haploid selection media containing 
both G418 and clonNAT, lacking uracil. Every 
query strain was screened in two independent 
replicates. 

Colony size was measured as a proxy for 
fitness, and digenic and trigenic interactions 
were scored using a quantitative model. Trigenic 
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interactions were classified into novel versus 
modified by overlapping with digenic inter- 
actions. Functional information embedded 
within digenic and trigenic interactions was 
assessed by their enrichment with external 
functional standards, such as protein-protein 
interactions, subcellular localization, coexpres- 
sion, and co-annotation. Trigenic interaction 
fraction was calculated as the ratio of the neg- 
ative trigenic interaction degree relative to the 
total negative digenic and trigenic interaction 
degree. Correlation of trigenic interaction 
fraction with physiological and evolutionary 
features included quantification of genetic 
interactions within a paralog pair, asymmetry 
of digenic interactions of members of each 
paralog pair, and sequence divergence rate, 
which was calculated as the raw difference be- 
tween the fold-changes in substitutions per site 
in post-WGD clades. The potential for paralog 
induction during developmental programs 
was assessed in (i) meiosis using published 
meiotic mRNA-seq and ribosome profiling 
datasets, (ii) filamentous growth using a pub- 
lished measure of invasion, as well as (iii) glu- 
cose starvation conditions using published gene 
expression dataset. Dosage selection was esti- 
mated using global digenic interaction profile 
correlation similarity. 

SAFE (Functional annotations based on the 
Spatial Analysis of Functional Enrichment) of 
the global genetic interaction profile similarity 
network was used to annotate gene function. 
Enrichment was calculated using the overlap 
of trigenic interactions with a neighborhood 
on the global digenic interaction similarity 
network. Novel paralog function for ECM13- 
YJR1I5W was interrogated using a drug sen- 
sitivity spot assay on media containing benomyl, 
and a liquid growth curve analysis on media 
containing latrunculin B. Spindle morphology 
was monitored by expressing Tub1-GFP, as 
well as sfGFP fusion proteins of Ecm13 and 
Yjrll5w and imaging the resulting strains using 
a spinning-disc confocal microscope. Novel 
paralog function for STB2-STB6 was monitored 
by Bap2-GFP localization in stb2A stb6A double 
mutant deletion strains and quantified using 
CellProfiler. 

To measure evolutionary constraints on paral- 
ogs, evolutionary rates for specific amino acid 
columns in multiple sequence alignments were 
computed using the discrete gamma model of 
protein evolution, as implemented in PAML for 
the pre-WGD sequences and for each paralog 
separately. Pearson correlation coefficients were 
computed between the rates of the pre-WGD 
clade to each paralog (pre-WGD & Paralog 1 and 
pre-WGD and Paralog 2), and between the two 
paralogs (Paralog 1 and Paralog 2) to classify 
paralogs into those with low and high corre- 
lation of position specific evolutionary rate 
patterns. BioGRID was used to curate genetic 
interactions for the mitochondrial carrier pro- 
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tein family. Paralog divergence was simulated 
using a computational framework in which a 
gene of fixed length was generated, annotated 
with hypothetical functions and subjected to 
random degenerative mutations at a constant 
rate. Evolution to a steady state was achieved 
when no more divergence mutations could be 
tolerated while maintaining viability. The result- 
ing paralogs were binned according to each 
pair’s initial level of structural entanglement, 
which is the level of mutable positions within 
a gene that carry out two or more functions to 
quantify the number of paralogs that reverted 
to singleton state, completely diverged or re- 
tained functional overlap. For a more detailed 
description of the experimental and compu- 
tational analyses, refer to the supplementary 
materials. 
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Muscle and neuronal guidepost-like cells facilitate 
planarian visual system regeneration 


M. Lucila Scimone*, Kutay D. Atabay*, Christopher T. Fincher, Ashley R. Bonneau, 


Dayan J. Li, Peter W. Reddient 


INTRODUCTION: Multiple strategies exist to pro- 
mote precise wiring of developing neuronal 
circuits. One strategy involves guidepost cells, 
which exist transiently in embryos. Guidepost 
cells can act as intermediate guidance tar- 
gets for axons or by providing a scaffold that 
facilitates axonal targeting. Most guidance 
mechanisms become dispensable once the 
circuit is assembled. Loss of guidance mech- 
anisms creates a potential limitation on re- 
generation of neuronal patterns—yet some 
animals are capable of functional regeneration 
of their nervous system. 


RATIONALE: Assuming some adult animals have 
the ability to regenerate functional neuronal 


circuits, they must possess mechanism(s) for 
de novo repair of neuronal patterns. In this 
study, we aimed to characterize such mech- 
anisms by studying regeneration of the pla- 
narian visual system after diverse injuries. 


RESULTS: We identified a rare subset of muscle 
cells (notum’*; frizzled 5/8-4") concentrated at 
two precise anatomical locations and in tight 
association with photoreceptor axons. The 
first group of these cells was found near the 
eye, where visual axons project and fasciculate 
to form a bundle. The second group of these 
cells was found near choice points, where sort- 
ing of contralateral and ipsilateral axons oc- 
curs. Both groups of muscle cells were formed 
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Adult guidepost-like cells facilitate visual system regeneration in planarians. Muscle and neuron 
guidepost-like cells are present at key locations near the planarian visual system and are formed independently 
of photoreceptor axons. Regenerating and transplanted eyes target projections to guidepost-like cells. 
Positional information cues provide an eye-extrinsic mechanism to place guidepost-like cells in the adult. 
Loss of guidepost-like cells is associated with visual system disruption. wnt-5, slit, and ndk are involved in 
positional control of guidepost-like cell placement. (25/8-4, frizzled 5/8-4. 
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during regeneration of the visual system and 
were always tightly associated with axonal 
projections, consistent with a possible role in 
attraction to facilitate visual system assembly. 
In addition, we found that a notum* set of 
neurons, located at the adult anterior brain 
commissure, regenerated before axonal mid- 
line crossing and was associated with optic 
chiasm regeneration. 

We reasoned that ifthe photoreceptor axon- 
associated notum’; frizzled 5/8-4* muscle cells 
have a guidepost-like function, their formation 

should be independent 
of eye cells. Eyes trans- 

planted to ectopic ana- 
Read the full article tomical locations did not 
at https://dx.doi. é 
org/10.1126/ result in the formation 
science.aba3203 of notum”; frizzled 5/8-4" 
muscle cells. Furthermore, 
animals that were unable 
to generate eyes [ovo RNA interference (ovo 
RNAi) animals] were still capable of specify- 
ing these muscle cells at the right locations. In 
addition, we predicted that if these muscle cells 
were indeed guidepost-like cells, visual axon 
trajectories should be associated with them 
after eye transplantation into eyeless heads. 
In all instances, axons from transplanted eyes 
projected toward notum’; frizzled 5/8- 4° mus- 
cle cells and often adjusted their trajectories 
after encountering them. 

We found that an array of signaling cues, 
which provide positional information essential 
for planarian patterning, was required for dic- 
tating the precise location of these guidepost- 
like cells. This provides a visual system-extrinsic 
mechanism for placing guidepost-like cells in 
the adult. 

Finally, with single-cell RNA sequencing 
and fluorescent in situ hybridization screen- 
ing, we identified molecules and transcription 
factors expressed in these cells. RNAi studies 
reduced or eliminated muscle or neuronal 
guidepost-like cell subsets and resulted in ab- 
errant patterns of visual axonal trajectories. 


CONCLUSION: Adult molecular and cellular 
strategies for regenerating neuronal pattern in 
the absence of embryo-specific contexts must 
exist to overcome damage or loss after injury. 
Understanding these mechanisms might pro- 
vide important insights for regenerative med- 
icine. Here, we found adult guidepost-like cell 
populations, extrinsic to the visual system and 
placed by adult positional information, that 
promote normal visual system regeneration 
in planarians. 


The list of author affiliations is available in the full article online. 
*These authors contributed equally to this work. 
{Corresponding author. Email: reddien@wi.mit.edu 
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Muscle and neuronal guidepost-like cells facilitate 
planarian visual system regeneration 


M. Lucila Scimone’?**, Kutay D. Atabay??>*, Christopher T. Fincher>?, Ashley R. Bonneau’2%, 


Dayan J. Li:??, Peter W. Reddien??>+ 


Neuronal circuits damaged or lost after injury can be regenerated in some adult organisms, but the 
mechanisms enabling this process are largely unknown. We used the planarian Schmidtea mediterranea 
to study visual system regeneration after injury. We identify a rare population of muscle cells tightly 
associated with photoreceptor axons at stereotyped positions in both uninjured and regenerating 
animals. Together with a neuronal population, these cells promote de novo assembly of the visual system 
in diverse injury and eye transplantation contexts. These muscle guidepost-like cells are specified 
independently of eyes, and their position is defined by an extrinsic array of positional information 
cues. These findings provide a mechanism, involving adult formation of guidepost-like cells typically 
observed in embryos, for axon pattern restoration in regeneration. 


ormation of neural circuits during de- 

velopment requires the orchestration of 

multiple events, such as the specifica- 

tion of neurons, the precise navigation 

of axons through the extracellular envi- 
ronment toward targets, and the generation 
of specific synapses. The first axons to extend 
in developing nervous systems, known as pio- 
neer axons, grow along stereotyped routes and 
fasciculate with each other to form tracts that 
are followed by subsequent axons. Ablation of 
pioneer neurons can perturb the guidance of 
subsequent axons, causing delay or misrout- 
ing, although in most cases these axons still 
locate their final targets (1-4). 

Growing axons navigate with the assistance 
of cell-extrinsic guidance cues, such as Netrins 
and their DCC/UNC-40 or UNC-5 receptors; 
Slits and their Robo receptors; Semaphorins 
and their Plexin receptors; and Ephrins and 
their receptors (5-8). In addition, transient 
cell-cell interactions have important roles in 
neural circuit assembly. Such interactions in- 
volve cells referred to as guideposts. Guidepost 
cells, which were originally identified in the 
grasshopper limb (9, 10), are discrete, early- 
born, and transient specialized cell popula- 
tions located at decision or choice points along 
axonal trajectories (6, 17, 12). Most identified 
guidepost cells are either glia or neurons 
(11, 13-16). On encountering guidepost cells, 
axon growth cones can change their respon- 
siveness to extracellular guidance cues and 
modify their trajectory (6). In most systems, 
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positioning of guidepost cells at choice points 
along the axonal path is instructed by canon- 
ical axon guidance cues themselves (5-7), and 
the proper location of guidepost cells is fun- 
damental for precise axonal tract development 
(17-20). Since the discovery of guidepost cells 
in the grasshopper limb, guidepost cells in 
several other organisms have been identified, 
including midline glia cells in Drosophila, floor 
plate cells in the vertebrate spinal cord, and 
CD44* neurons and retinal glial cells found 
at the developing optic chiasm in mammals 
(21-24). 

The transient developmental programs that 
guide pioneer axons to their ultimate targets 
become dispensable once the neuronal circuit 
is assembled. In regenerative species, however, 
neuronal circuits that are lost to injury must 
reestablish themselves in the adult animal. 
This presents a puzzle: How can an adult neu- 
ronal circuit be formed de novo by regeneration 
in the absence of embryonic and potentially 
transient guidance mechanisms, such as guide- 
post cells? In some instances, damage to the 
nervous system can involve repair by new neu- 
ron production but is limited by the inability 
of new neurons to wire correctly (25). How- 
ever, in some other animals, regeneration of 
the nervous system seemingly restores normal 
function, although study of the regeneration 
of wiring patterns in highly regenerative mod- 
els is sparse. Animals that have the ability to 
regenerate their neuronal architecture must 
possess adult mechanism(s) for repairing cir- 
cuit patterns in the context of diverse injuries. 
To address how this can occur, we studied 
these mechanisms in the planarian Schmidtea 
mediterranea. 

Planarians are freshwater flatworms that 
belong to the Spiralia superphylum (26). They 
are capable of whole-body regeneration and 
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undergo constant tissue turnover. These pro- 
cesses involve neoblasts, a proliferating cell 
population containing pluripotent stem cells. 
In addition to dividing cells, proper regener- 
ation requires positional information. Genes 
proposed to encode positional information 
in planarians are predominantly expressed in 
muscle cells in a regionally restricted manner 
across body axes (27). Whereas the cells and 
molecules guiding the fate choices of stem 
cells into new differentiated cells in planarian 
regeneration have received substantial atten- 
tion, the mechanisms by which the architecture 
of the different planarian tissues is restored 
remains little studied. 

We identified a set of muscle cells and neu- 
rons tightly associated with the planarian pho- 
toreceptor axons with guidepost-like attributes. 
These cells were constitutively present and 
capable of being regenerated after injury. The 
positioning of these muscle cells was guided 
by a constitutive system of positional infor- 
mation enabling their replacement at suitable 
positions after injury to promote precision in 
the regeneration of visual system architecture. 
Our results describe a strategy for de novo for- 
mation of precise axonal projection pattern 
after injury in the adult. 


notum’*; frizzled 5/8-4* muscle cells are 
tightly associated with the visual system 


Many planarian species have a pair of true 
cerebral eyes that are symmetrically located 
dorsal to the cephalic ganglia (28). Planar- 
ian eyes contain rhabdomeric photoreceptor 
neurons that extend their axons ipsi- and con- 
tralaterally in axon bundles and pigmented 
cells that build an optic cup (Fig. 1A). The 
visual axons follow a stereotypic path that can 
be visualized using an anti-Arrestin mono- 
clonal antibody (28). Contralateral axons de- 
fasciculate from ipsilateral axons at choice 
points and project toward the midline, form- 
ing the optic chiasm. Both contralateral and 
ipsilateral axons project ventrally and posteri- 
orly to the visual targets in the brain (29, 30). 

Using fluorescence in situ hybridization 
(FISH) in combination with immunostaining 
with an anti-Arrestin antibody, we noticed a 
previously unknown and small population of 
cells that were in close association with the 
visual system, marked by expression of the 
genes notum and frizzled 5/8-4 (Fig. 1, B and 
C). We found that these cells were mostly 
concentrated in two regions: (i) near the eye, 
where the photoreceptor axons form a bun- 
dle projecting out from the eye, and (ii) near 
the axon choice points, located more ventrally 
(Fig. 1, B to D). The number of these notwm*; 
frizzled 5/8-4" cells found in each animal was 
variable but scaled with animal size (Fig. 1D). 
In a transverse cross section at the level of 
the eye, these cells were clearly distinguishable 
from other notum* cells known to exist in the 
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Fig. 1. notum’*; frizzled 5/8-4* A 
muscle cells are tightly 
associated with the planarian 
visual system. (A) Schematic 
diagram of planarian visual system. 
(B) notum’* cells associated with 
visual axons. Red box, location of 
image taken. (C and D) Coex- 
pression (C) and density map 
[(D), top] of notum and frizzled 
5/8-4 (f25/8-4) in cells associated 
with visual axons. Dark pink, 

cells near the eye; light pink, cells ¢ 
at choice points. The graph [(D), 

bottom] shows a positive correla- 
tion between the number of cells 
associated with the visual system 
and animal size (length). Red 
line, best fit for linear regression. 
R*. coefficient of determination. 
(E) Transverse cut (red line) shows 
nuclei position of three notum* 
cell subsets. Yellow dotted line, 
notum’; frizzled 5/8-4* cells. Dorsal, 
top; ventral, bottom. (F) SMEDWI-1 
presence in a cell associated with 
visual system indicates recent 
specification. (G to J) notu* cells 
associated with visual axons do not 
express eye-specific markers (G), 
neuron-specific markers (H), or glia- 
specific markers (I) but do express 
muscle-specific markers [(J); pool: 
troponin, tropomyosin, colF-2, 

colF-10]. (K) Muscle notum” cells 

are associated with visual axons in 

S. polychroa embryos. The cartoon 
shows embryonic stages. In (G) 
(0 (K), white arrows point to 
notum* cells associated with the 
visual system. Ratios represent the 
raction of double-positive cells 
(white) of total (magenta). When 
both numbers are in the same color, 
the ratio indicates the fraction of 
animals with that phenotype. Scale 
bars are 50 pum in (B), (C), (E), 
and (G) to (K) and 10 um for 

all magnified views in (C), (F), and 
(H) to (J). 


Axonal 
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planarian head, such as anterior pole cells (37) 
and notum* neurons (32) located at the an- 
terior commissure of the brain (Fig. IE and 
fig. SLA). The nuclei of the notum’*; frizzled 
5/8-4* cells found near the eye were located 
dorsal to the nuclei of the notum’*; frizzled 
5/8-4* cells found at the choice points, which 
were observed immediately dorsal to the 
cephalic ganglia (movie S1). To make this 
dorsal-ventral-location distinction clear, we 
have colored these two populations differ- 
ently in illustrations throughout this article. 
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The clear association of these cells at two 
discrete and important locations near axons 
of the visual system raised the possibility of a 
role for these previously unidentified cells in 
visual system wiring. 

Planarian eye cells, like all other cell types 
assessed in this organism, are replaced during 
normal tissue turnover. This process requires 
neoblasts, the only proliferating somatic cells 
in the animal (33). Neoblasts express the piwt 
homolog smedwi-1l. smedwi-1 transcription 
ceases as cells leave the neoblast state and 
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begin to differentiate, but the SMEDWI-1 pro- 
tein transiently perdures as cells differentiate. 
An antibody that recognizes SMEDWI-1 there- 
fore allows visualization of newly generated 
cells (34, 35). We observed a small number 
of notum*; frizzled 5/8-4* cells positive for 
SMEDWJI-1 protein, indicating that these cells 
are constantly specified and replaced in un- 
injured adult animals (Fig. 1F). 

To determine the identity of notum’; frizzled 
5/8-4* cells, we examined them for coexpres- 
sion of different cell-specific markers. These 


2 of 14 


RESEARCH | RESEARCH ARTICLE 


cells did not express the photoreceptor neu- 
ron marker opsin or the optic cup marker 
tyrosinase (Fig. 1G and fig. S1, B and C); the 
neuronal markers pc2, synapsin, or ChAT (Fig. 
1H and fig. S1D); or the planarian glia markers 
if-1 and cali (Fig. 11). Some fz5/8-4* ChAT" cells 
were found near the eye but did not express 
notum (fig. S1E). Instead, notum*; frizzled 
5/8-4" cells expressed muscle markers (Fig. 1J) 
and could occasionally be detected in cross 
sections colabeled with an antibody for pla- 
narian muscle (6G10; fig. S2A). Muscle cells 
are broadly distributed within the animal (fig. 
S2, A and B); however, notum’; frizzled 5/8-4" 
muscle cells were a rare subpopulation present 
in association with visual axons (movie S2). 
notum’; frizzled 5/8-4" cells expressed low lev- 
els of muscle markers (troponin, tropomyosin, 
colF-2, and mp-1; fig. S2, C to E) compared 
with other muscle cell subsets such as body 
wall, intestinal, and other dorsal-ventral mus- 
cle cells (36, 37), suggesting that these cells 
could be specialized muscle cells serving a 
different role. Muscle cells in general are 
known to secrete signaling factors in planar- 
ians (27). Despite lower levels of expression of 
muscle markers per cell, signal from pooled 
probes for canonical muscle markers, such as 
troponin and tropomyosin, was detected in 
essentially all of the notum’*; frizzled 5/8-4" 
cells (Fig. 1J and fig. S2F). Whether these cells 
retain contractile function is unknown. notum* 
muscle cells associated with the visual axons 
were also found during planarian embryonic 
development. Such notum* cells were first ob- 
served in prehatchling stage 7 S. polychroa 
embryos at the time of optic chiasm forma- 
tion, but not before (Fig. 1K). After hatching, 
all free-swimming juveniles displayed a simi- 
lar array of notum* muscle cells and visual 
axons, as did their adult counterparts (Fig. 
1K). The tight association of notum’; frizzled 
5/8-4° muscle cells with visual axons raised 
the possibility of a role for these cells in the 
wiring of the planarian photoreceptor axons. 


Regenerating visual axons project toward 
notum’*; frizzled 5/8-4* muscle cells 


We defined notum’; frizzled 5/8-4* muscle 
cells near the eye as NMEs (notum* muscle 
cells near the eye) and those at axon choice 
points as NMCs (notum* muscle cells at the 
choice point) (Fig. 2A). If NMEs and/or NMCs 
facilitate wiring of photoreceptor axons, they 
should be present before or concurrently with 
key axon-pathfinding processes and should 
also be tightly spatially associated with such 
events. To examine this possibility, we studied 
the regeneration of these cells and the dy- 
namics of photoreceptor axons after different 
regenerative challenges. 

First, we examined these cells in the context 
of a unilateral eye resection, which does not 
perturb the preexisting optic chiasm [Fig. 2B; 
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(38)]. This injury removed NMEs but did not 
remove NMCs (Fig. 2B). Within 2 to 4 days 
after eye resection, new photoreceptor neu- 
rons nucleated dorsally (regenerating the 
resected eye) and projected their axons ven- 
trally and posteriorly. NMC numbers stayed 
constant during this time (Fig. 2C). Despite 
variance in the relative positions of NMCs with 
respect to the new photoreceptor neurons and 
in the initial paths of projecting axons, the 
trajectories of photoreceptor axons essentially 
always coincided with the NMCs (Fig. 2, B and 
D, and fig. S3, A and B). This tight associa- 
tion between NMC position and axon path is 
consistent with the possibility that NMCs 
have an attractive influence on photorecep- 
tor axons. NMEs regenerated 4 days after 
eye resection, suggesting that NMEs are not 
necessary for early projections during eye re- 
generation after resection, a scenario where 
NMCs remained present throughout the re- 
pair process. 

We next studied the behavior of photore- 
ceptor axons, NMEs, and NMCs after head 
amputation. In this context, the animals need 
to regenerate the entire visual system, includ- 
ing NMEs and NMCs. Between days 2 and 3 
after decapitation, we observed nucleation 
of new eyes and the appearance of notum’*; 
frizzled 5/8-4* cells in close proximity to the 
regenerating eyes (Fig. 2E and fig. S3C). At this 
early time point, the regenerating blastema 
is small, making distinction between notum’; 
Srizzled 5/8-4 NMEs and NMCs not possible. 
These cells displayed detectable frizzled 5/8-4 
transcripts first, followed by notuwm expression 
(Fig. 2E). At this time, photoreceptor axons 
from the regenerating eyes projected toward 
where notum’; frizzled 5/8-4° cells were ob- 
served (Fig. 2, E and F, and fig. $3, A and C). 
The pattern of axonal projections and notum’; 
Srizzled 5/8-4* cell location was variable, with 
some axon tracts projecting anterior-medially 
and others posteriorly, suggesting that the 
initial steps in axonal pattern formation are 
noisy, but the spatial association of axons 
and notum’; frizzled 5/8-4* cells was apparent 
nonetheless. This is consistent with the possi- 
bility that in the context of de novo visual 
system formation, these cells have an attract- 
ive influence on early axonal projections. It is 
also possible that photoreceptor axons and 
notum’; frizzled 5/8-4* cells might stabilize 
each other on interaction. 

We noticed that the population of notum* 
neurons in the medial brain, normally located 
ventral to the optic chiasm in a fully formed 
brain, was in close association with photo- 
receptor axons during early head regener- 
ation. We refer to this third population of 
cells associated with photoreceptor axons 
during regeneration as notum* brain cells 
(NBCs) (Fig. 2A). NBCs were apparent within 
3 to 4 days after amputation at the regenerat- 
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ing anterior brain commissure [Fig. 2E; (32)]. 
Visual axons only projected to the midline, 
forming the optic chiasm, after NBCs appeared, 
and despite variation in their paths, they were 
typically associated with NBCs (Fig. 2, Eand G, 
and fig. $3, A and C). This process is reminis- 
cent of optic chiasm formation in mouse 
embryonic development, in which the site of 
the future optic chiasm is first populated by 
a subset of neurons expressing L1 and CD44. 
Subsequently, retinal ganglion axons grow 
and project toward the midline using these 
neurons as a landmark or a scaffold (24). Other 
cell types, such as radial glial cells at the optic 
chiasm, or midline cells in insects have a sim- 
ilar scaffolding role for axons crossing the 
midline during development (39). 

In planarian head regeneration, early pho- 
toreceptor axonal projections cross the midline 
[Fig. 2, E and G; (40)]. At the time of optic 
chiasm formation, the pattern of axonal pro- 
jections with choice points and notum’; frizzled 
5/8-4* cells near them became apparent (Fig. 
2E). We also observed axonal projections to 
the brain targets at this time. By day 4 of 
regeneration, the pattern of the photoreceptor 
axons was similar across animals, indicating 
that initial patterning during de novo regen- 
eration resolves with time into a stereotypical 
pattern. The events that occur can be ap- 
proximated into the following stages: (i) eye 
nucleation, (ii) NME and/or NMC regeneration, 
and axon bundle formation and projection 
toward NMEs and/or NMCs, (iii) NBC regener- 
ation, (iv) axonal projections toward NBCs and 
optic chiasm formation, and (v) axonal projec- 
tions toward the visual areas in the brain. The 
close association between photoreceptor axons 
and NMEs, NMCs, and NBCs in this process is 
consistent with the possibility that these cells 
have a function in facilitating visual system 
wiring. 


NMEs and NWMCs are specified 
independently of eyes 


If NMEs and NMCs have a guidepost-like func- 
tion, their formation should at least in part be 
independent of the photoreceptor neurons 
themselves. To examine this possibility, we 
transplanted wild-type eyes (41) at different 
positions in prepharyngeal and postpharyngeal 
animal regions. We did not observe any cells 
coexpressing notum and frizzled 5/8-4 near 
the transplanted eyes 10 days after transplan- 
tation (Fig. 3A and fig. S4A). Interestingly, we 
observed that axon bundles from transplanted 
eyes were commonly defasciculated (Fig. 3A 
and fig. S4A). These findings suggest that eyes 
in ectopic locations are insufficient to induce 
notum’; frizzled 5/8-4* muscle cells. 
Planarian eye cells (both photoreceptor neu- 
rons and optic cup cells) are specified by the 
transcription factor ovo (42). ovo RNA inter- 
ference (RNAi) animals cannot replace eye 
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Fig. 2. notum*; frizzled 
5/8-4* muscle cells are 
associated with regenerat- 
ing visual axons. (A) Illustra- 
tions show NME, NMC, and 
NBC positions across different 
axes. (B) Regeneration of 
visual system at different time 
points after unilateral eye 
resection. The bottom images 
are magnified views of visual 
axonal projection examples 

(1 to 4) 2 to 4 days after eye 
resection. Dotted outline, 
NMEs and NMCs. The left 
cartoon shows the surgical 
procedure: head decapitation 
(dotted line; day -3) and 
unilateral eye resection 3 days 
later [day 0 (dO); red line]. 
Red box, location of image 

taken. The cartoons on 

the right summarize events 
observed after eye resection. 

(C) There is no change in 

NMC numbers between 

resected and control sides. 

(D, F, and G) Circular plots 

show tracing of photoreceptor F 
axonal trajectories (lines) 

from independent right eyes 
during regeneration (d2 to d4) 

of a resected eye (D) or 
decapitation [(F) and (G)]. 

Colored dots represent NMCs 

(D), NMEs and NMCs (F), 

or NBCs (G). (E) Regenerating 
visual system after decapita- 

tion. The dotted line in 

the left cartoon indicates 

the amputation line, and the 

red box shows the location 

of the image taken. Illustra- 

tions (right) summarize 
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events observed after decapitation. Blue arrows, axons; white arrows, NMEs and/or NMCs; dark pink arrows or dots, NMEs; light pink arrows or dots and 


black arrows, NMCs. Orange dots or dotted outline, NBCs; yellow arrows, NMEs and NMCs expressing on 


and (D) to (G). 


cells during normal tissue turnover or regen- 
erate eyes after eye resection or decapitation. 
Nonetheless, NMEs and NMCs were both pres- 
ent in uninjured ovo RNAi animals, including 
in animals completely lacking photoreceptor 
neurons (Fig. 3B). However, their numbers 
were significantly reduced compared with 
control animals (Fig. 3B), suggesting a po- 
tential role for photoreceptor axons in the 
maintenance of normal NME and NMC num- 
bers. Next, to determine whether NMEs and 
NMCs could regenerate independently of eye 
cells, we performed different injuries in ovo 
RNAi animals. We found that NMEs were able 
to regenerate after unilateral eye resection in 
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ovo RNAi animals, indicating that eye cells 
were not required for NME specification in 
this context (Fig. 3C). Furthermore, NMEs 
were still observed 30 days after double-eye 
resection in ovo RNAi animals, suggesting that 
these cells were still specified even when no 
axons were left (fig. S4B). Moreover, NMEs 
and NMCs also regenerated after decapitation 
of ovo RNAi animals (Fig. 3D and fig. $4, C and 
D), although the total number of NMEs and 
NMCs in this context was significantly lower 
compared with controls (Fig. 3D). In conclu- 
sion, both NMEs and NMCs were observed 
in uninjured or regenerating animals that 
completely lacked photoreceptor neurons and 


2020 


y frizzled 5/8-4. Scale bars are 50 um in (B) 


associated axons. Together, these data suggest 
that NMEs and NMCs can be formed in the 
absence of eyes, but their homeostatic num- 
bers are positively influenced by photorecep- 
tor axons. 


Axonal projections from transplanted eyes 
associate with NMEs and NMCs 


If visual circuit-associated muscle cells (NMEs 
and NMCs) have guidepost-like activity, we 
predicted that visual axonal projections should 
be correlated to their location not just during 
regeneration but also after eye transplanta- 
tion in the head. We performed unilateral eye 
transplantations at approximately the original 
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Fig. 3. Visual axons are 

not required for NME and NMC 
specification but are required 
for maintenance. (A) NMCs 

are observed associated with 
original visual axons but not with 
axons from transplanted eyes. 
The cartoon on the left shows the 
location of transplanted eyes. 

(B) NMEs and NMCs are present in 
ninjured animals with few or 

no visual axons (ovo RNAi). The 
density map at the top right shows 
E and NMC distributions in 

n idealized visual system cartoon. 
icates the number of 
nimals mapped. The graph 

t the bottom right shows NME 
MC numbers in uninjured 
animals. (© and D) Regener- 
of NMEs (C) and NMEs 

nd NMCs (D) in ovo RNAi animals 
fter eye resection (C) or head 
amputation (D). Mapping at the 
bottom left shows NME and 

C distributions in an idealized 
visual system cartoon. n indicates 
the number of animals mapped. 
The graph at the bottom right 
shows NME and NMC numbers. 
The red box shows the location of 
the image taken. Dark pink arrows 
or dots, NMEs; light pink arrows 
or dots, NMCs. Scale bars are 
100 um in (A) and 50 um in 

(B) to (D). 
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eye position in double-eye-resected ovo RNAi 
recipients (fig. S5A). As shown above (Fig. 3B), 
some NMEs and NMCs perdured in intact ovo 
RNAi animals, allowing assessment of cor- 
relation between the location of eye projec- 
tions and NMEs and NMCs. One to 3 days 
after double-eye resection was not sufficient 
time to eliminate all preexisting photoreceptor 
axons in ovo RNAi recipients (fig. S5, B and C). 
Axonal projections from all transplanted eyes 
at this time point were able to navigate and 
follow the stereotypical path (fig. S5, B and C). 
To eliminate a potential role for remaining 
axons in serving as a scaffold for transplanted 
axons, we transplanted eyes 10 to 12 days after 
double-eye resection of ovo RNAi recipients. 
Within this time window, 87.5% of the ovo 
RNAi recipients had no remnant axons left, 
although some NMEs and NMCs were present 
at the time of transplantation (dO in Fig. 4A). 
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Axons from all transplanted eyes were prone 
to defasciculation and errors but projected 
ipsilaterally and crossed the midline in a 
stereotypical manner (Fig. 4A and fig. S5D). 
We observed that some NMEs were cotrans- 
planted with the wild-type eye, explaining, 
at least in part, the increased number of ob- 
served NMEs at day 7 after transplantation 
(Fig. 4B). Importantly, in all instances, even 
when transplanted eyes were slightly mis- 
positioned, the visual axon tracts extended 
toward the NMCs present in recipient ani- 
mals and bifurcated at these choice points 
(magnified views in Fig. 4A and fig. S5D). 
Moreover, in some cases, we observed that 
axons locally deviated from the stereotypical 
route of the main axon bundle toward NMEs 
or NMCs that were present in the recipient 
animals (magnified views in Fig. 4A and fig. 
S5D). Trace mappings of axonal trajectories 
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showed that nearly all axons passed close by 
NMEs and NMCs (within two cell diameters), 
and many directly interacted with NMEs and 
NMCs (Fig. 4C and fig. S6, A and B). The lo- 
cations of eyes, axonal projections, and NMEs 
and NMCs were variable in these experiments, 
but tight association of projections with NMEs 
and NMCs was observed nonetheless. These 
results are consistent with the possibility that 
NMEs and NMCs might attract and/or stabilize 
visual axons. Moreover, our observations show 
that formation of an optic chiasm in adult pla- 
narians does not depend on interactions be- 
tween the axons coming from each eye, in 
contrast to vertebrates (23, 39). 


Extrinsic axial patterning cues control NME 
and NNC positioning 


If NMEs and NMCs facilitate a photoreceptor 
axonal projection pattern in regeneration, then 
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Fig. 4. NMEs and A 
NNCs facilitate visual 
system patterning 
after eye transplanta- 
tion. (A) Shown at 

the top, NMEs and 
NMCs are still present 
10 to 12 days after 
double-eye resection in 
ovo RNAi animals 
before a wild-type (wt) 
eye is transplanted 
(left). Complete 
(middle) or incomplete 
(right) recapitulation of 


Arrestin 


1- 8F ovo RNAi 
2- eye resection 
3- eye transplant 
(10-12 days after 
resection) 
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eight times (8F), double-eye resection was performed, and wild-type eyes were transplanted 10 to 12 days after eye resection. Animals were fixed just before (dO) 
or 7 days after (d7) transplantation. In the middle, cartoons show axonal tracings of the image shown above and the positions of NMEs, NMCs, and NBCs. The 
images at the bottom are magnified views (white dotted box) showing axonal projections from transplanted eyes in close association with NMEs and NMCs (black arrows). 
(B) NME and NMC numbers before and after transplantation. (©) Cartoons show examples from (A) (i to viii) of NME and NMC associations with visual axons. 
within a two-cell-diameter distance to the axonal tract are shown. Axonal tracing observed near 48 NMEs and 
NMCs after transplantation is shown at the bottom. Dark pink arrows or dots, NMEs; light pink arrows or dots, NMCs. Scale bars are 50 um in (A). 


Numbers of cells that contact an axonal tract or found 


the positioning of these muscle cells should 
be regulated, at least in part, independently 
from the visual system itself. That NMEs and 
NMCs can be regenerated independently of 
photoreceptor neurons is consistent with this 
possibility and suggests that an extrinsic mech- 
anism should exist to position these cells. Cor- 
rect positioning of guidepost cells is required 
for the precise wiring of neural circuits in other 
organisms. In most examined cases, canonical 
axon guidance cues (Netrins and DCC or UNC-5; 
Slits and Robo; Semaphorins and Plexins; and 
Ephrins and Ephrin receptors) also affect 
guidepost cell positioning (72). To examine the 
role of conserved guidance cues in NME and 
NMC positioning, we inhibited previously re- 
ported and newly identified planarian genes 
encoding guidance cue homologs with RNAi 
(fig. S7, A and B). Inhibition of these genes 
perturbed photoreceptor axonal projections 
[fig. S8, A and B, and (43-46)] but did not alter 
overall numbers or positions of NMEs and 
NMCs (fig. S8, A to C), except when planarian 
axial patterning was disrupted (see below). 
Positional information is essential during 
both planarian regeneration and normal tis- 
sue turnover to inform neoblasts and/or their 
progeny about the location of tissues that 
need to be replaced. In planarians, a variety 
of signaling molecules known as position con- 
trol genes (PCGs) have important roles in 


axial patterning and constitute adult posi- 
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tional information. PCGs are predominantly 
expressed in muscle cells along different pla- 
narian axes (27, 47), and inhibition of some 
PCGs results in notable patterning defects, in- 
cluding hypercephalized animals, animals with 
duplication of eyes or pharynges (33, 47-49). In- 
hibition of PCGs affected the pattern of NMEs 
and NMCs during normal tissue turnover 
(Fig. 5A) and regeneration (fig. S8A). 

The planarian medial-lateral (ML) axis is 
regulated by medial slit and lateral wnt5, with 
wnts restricting slit expression to the midline 
(50). Inhibition of slit or wnt5 affected the 
ML positioning of NMEs and NMCs in both 
uninjured and regenerating animals (Fig. 5A 
and fig. S8A). slit RNAi animals showed a shift 
of these cells toward the midline, where an 
ectopic or cyclopic eye was observed (Fig. 5A 
and fig. S8A). 7oboC RNAi resulted in a similar 
phenotype to that of slit RNAi, suggesting that 
roboC encodes a major Slit receptor (fig. S8, A 
and D). wntS RNAi animals showed increased 
numbers of NMEs and NMCs laterally, in con- 
junction with the appearance of ectopic lateral 
eyes (Fig. 5A). Regenerating wntS RNAi ani- 
mals did not form an optic chiasm, and NMEs 
and NMCs were displaced more laterally (fig. 
S8A). ror-I RNAi resulted in a phenotype sim- 
ilar to that of wntS RNAi animals during both 
regeneration and homeostasis (fig. S8, A and 
D), suggesting that ror] encodes the Wnt5 re- 
ceptor in this process. 
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RNAi of genes involved in the patterning 
of the anterior-posterior (AP) axis affected the 
AP positioning of NMEs and NMCs (Fig. 5A). 
notum RNAi causes anteriorization of brain 
and ectopic anterior eye appearance (32), and 
these animals showed ectopic anterior NMEs 
and NMCs. Because of the axial patterning 
role of notum, it was not possible to readily 
determine the function of notum within NME 
and NMCs. 

nou darake (ndk) RNAi animals develop 
ectopic posterior eyes (47, 48) and shift ante- 
rior PCG expression domains posteriorly (47). 
ndk RNAi caused ectopic posterior appearance 
of NMEs and NMCs. Major surgery leading 
to extensive PCG shifts in head fragments (41) 
also resulted in the appearance of ectopic 
NMEs and NMCs at positions consistent with 
the locations of ectopic de novo eyes (Fig. 5A). 
In PCG RNAi experiments, NMEs and NMCs 
were sometimes observed in ectopic locations 
before the nucleation of a new ectopic eye or 
the projection of a new axon bundle (fig. S9A). 
This is consistent with the possibility that PCG 
alteration itself leads to ectopic positioning 
of NMEs and NMCs as opposed to these cells 
being induced in ectopic locations only after 
ectopic eyes or axonal projections appeared. 

To further test this hypothesis, we generated 
posteriorized ndk RNAi animals. After allowing 
RNAi to wear off, these animals maintained all 
ectopic posterior eyes and posteriorly expanded 
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Fig. 5. Axial patterning genes are required for positioning of NMEs and NMCs. 
(A) Visual system and NMEs and NNCs in different uninjured RNAi conditions or 
ollowing surgery are shown on the left. The density map (top right) shows 

ME and NMC distributions in an idealized visual system 
the number of animals mapped. The heatmap (bottom right) shows numbers of 
MEs and NMCs in each quadrant. (B) Illustration (top) summarizing the 
experimental procedure. Rescaling of ndl-2 PCG expression, maintenance of 
posteriorized brain lobe in old tissue, and regeneration of normal size brain lobe in 
blastema are shown in the middle. Shown at the bottom, 
present near eyes located at the correct position after PCG rescaling (1) but not 
near ectopic posterior eyes (2) in an ndk RNAi animal. White arrows point to brain 


brain lobes, even though ndk function re- 
turns and the PCG map scales back to its nor- 
mal proportions (41). Sagittal amputations at 
this time result in animals regenerating a side 
with a new eye and brain lobe at the correct 
scale and anterior position, generating an 
asymmetrical body plan [Fig. 5B and (4/)]. 
Under these conditions, NMEs and NMCs only 
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illustration. n indicates 
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regenerated at the correct AP position with 
respect to the rescaled PCG domains, near 
the regenerating eye within the blastema. 
Within the uninjured old tissue side, which 
maintained a posteriorly expanded cephalic 
ganglion and ectopic eyes, NMEs and NMCs 
were also only observed near the eye that 
was found at the correct position based on 
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obes or neurons (cintillo*). (©) Mapping shows N 
idealized visual system cartoon of uninjured PCG RNAi animals in the presence 
(control) or absence (ovo RNAi) of eyes. n indicates the number of animals 
mapped. The heatmap shows the total number of 
quadrant. slit RNAi leads to medialization, notum RNAi leads to anteriorization, 
and ndk; nd/-4 RNAi leads to posteriorization of 
independently of eyes. (D) wnt5 RNAi animals show a lateralized NME and 

MC distribution, whereas slit RNAi animals show medialized cell distribution 
independently of visual axons during regeneration. Dotted lines in cartoons show 
amputation planes. Dark pink arrows or dots, N 
MCs. Scale bars are 50 um in (A) and 
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the rescaled PCG map. Posterior ectopic eyes 
did not maintain NMEs and NMCs (Fig. 5B 
and fig. S9B), unlike animals that were kept 
under zdk RNAi conditions (Fig. 5A). These 
results further indicate a role for PCGs, rather 
than ectopic eyes themselves, in instruct- 
ing the positioning of newly specified NMEs 
and NMCs. 
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and soxP-5 are required 
for specification of guide- 
post-like cells and precise 
wiring of the visual sys- 
tem. (A) Defasciculation of 
visual axons (white dotted 
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Because NMEs and NMCs can be formed in 
the absence of eyes (Fig. 3D), we inhibited both 
ovo and PCGs at the same time to determine 
whether the ectopic positioning of NMEs and 
NMCs is still observed in PCG RNAi conditions 
in the absence of eye formation (Fig. 5, C and 
D, and fig. S9, C and D). In both uninjured and 
regenerating RNAi animals, NMEs and NMCs 
were mispositioned regardless of the presence 
Gin control) or absence (in ovo RNAi) of new 
photoreceptor neurons. Taken together, our 
results indicate that PCGs constitute a posi- 
tional information system that instructs the 
positioning of NMEs and NMCs indepen- 
dently of eye cells, providing a mechanism to 
place cells with candidate guidepost-like func- 
tion in specific locations. 


NMCs are absent in tolloid RNAi animals 


Bmp signaling ligands have been shown to 
guide commissural axons of the spinal cord in 
vertebrates ventrally, through repulsion (57, 52). 
Inhibition of the planarian bmp4 homolog 
did not grossly affect the pathfinding of visual 
axons during regeneration (53). However, in- 
hibition of tolloid, a metalloproteinase that 
can cleave the Bmp inhibitor Chordin in ver- 
tebrates and Drosophila (54, 55), resulted in 
severe defects in the trajectory of photore- 
ceptor axons (53). In regenerating tolloid RNAi 
animals, axons were more defasciculated and 
were not able to cross the midline, resulting in 
disorganized posterior projections (Fig. 6, A 
and B, and fig. S10A). The midline cues slit and 
netrin-2 were normally expressed in these ani- 
mals (fig. SIOB). NMEs and NBCs were still 
present, but NMCs were completely absent 
in regenerating tolloid RNA animals (Fig. 6A). 
NMCs were also severely reduced in tolloid 
RNAi animals during eye regeneration after 
double-eye resection and during homeostasis 
(Fig. 6C and fig. S10C). Under all of these con- 
ditions, photoreceptor axons showed higher 
levels of defasciculation. 

We next transplanted eyes into tolloid; ovo 
RNAi recipients, which lack NMCs. Indepen- 
dently of the presence or absence of remain- 
ing axons (transplantation 3 or 10 days after 
double-eye resection), wild-type photoreceptor 
axons were not able to robustly project in a 
fasciculated bundle or cross the midline in 
tolloid; ovo RNAi recipients (Fig. 6D and fig. 
S10D). These results are consistent with the 
possibility that NMCs can attract and/or sta- 
bilize axons to the choice points and facilitate 
axon bundle formation. Similar results were 
observed when transplanting a wild-type eye 
into a netrin-1; netrin-2; ovo RNAi recipient or 
when transplanting an eye from a DCC RNAi 
animal into an ovo RNAi recipient (fig. SIOE). 
Metalloproteinases have been shown to cleave 
the DCC receptor (56), raising the possibility of 
direct or indirect interaction between Tolloid 
and DCC in planarians. The midline defect in 
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tolloid RNAi animals indicates that additional 
tolloid-dependent processes likely exist and 
affect midline axonal crossing. 


arrowhead is required for NBC specification 
and optic chiasm formation 


Specific ablation of CD44-expressing chiasm 
neurons in mice demonstrated a role for these 
cells in the formation of the optic chiasm during 
embryonic development (57). Cell-specific abla- 
tion strategies are not currently available in 
planarians. However, in some instances, RNAi 
of transcription factor-encoding genes in- 
volved in cell-fate specification can be used to 
deplete animals of specific cell populations 
(34, 42, 58, 59). Inhibition of arrowhead, which 
is expressed medially to the cephalic ganglia 
and encodes a Lim homeodomain protein, 
resulted in failed regeneration of the optic 
chiasm and defective axonal crossing at the 
midline (60). arrowhead RNAi did not affect 
the expression of normal midline guidance 
cues, such as slit (60) and netrin-2 (fig. S11A). 
Visual axonal projections did not overtly follow 
other axon bundles at the anterior commis- 
sure in regenerating arrowhead RNAi animals 
(fig. SIIB). We hypothesized that arrowhead 
might be required for the regeneration of 
NBCs at the anterior commissure and thereby 
affect midline crossing of visual axons. Indeed, 
NBCs expressed arrowhead both in intact 
and regenerating animals (Fig. 6E and fig. 
S11C), and arrowhead RNAi animals failed to 
regenerate NBCs (Fig. 6F). However, we can- 
not rule out the possibility that other cells 
specified by arrowhead also contribute to optic 
chiasm formation. Numbers of NMEs and 
NMCs were not affected in arrowhead RNAi 
animals during head regeneration (Fig. 6F) or 
after double-eye resection (fig. S11D). Intact 
arrowhead RNAi animals that lost all NBCs 
during normal tissue turnover were still able 
to properly rewire the visual system after 
double-eye resection (fig. S11D), suggesting 
that NBCs might be dispensable for mainte- 
nance of the optic chiasm. Importantly, visual 
axons after double-eye resection perdure for 
around 3 days, which gives sufficient time for 
new photoreceptors to regenerate and project 
their axons to the brain targets following the 
preexisting axonal tract. Alternatively, the 
photoreceptor axons could also be influenced 
by the existing brain commissure. In addition, 
RNAi animals with duplicated optic chiasms 
[notum RNAi animals; (32)] also had ectopic 
anterior brain commissures with a duplica- 
tion of NBCs (Fig. 6G). 


soxP-5 is required for NME and 
NNC specification 


To examine the transcriptomic signature of 
NMEs and NMCs, we performed single-cell 
RNA sequencing (scRNA-seq) using Drop-seq 
(61) on cells isolated from the region containing 
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the visual system in intact animals (Fig. 6H). 
Cells were clustered using Seurat into 28 clusters, 
each of which could be assigned to a certain 
tissue class on the basis of tissue-specific gene 
expression [(62); Fig. 6H]. We recovered eight 
notum’; frizzled5/8-4" cells that were negative 
for brain markers (pc2 and ChAT) and anterior 
pole markers (foxD) and that clustered together 
with muscle cells (Fig. 6H, fig. S12A, and table 
Sl). We performed single-cell differential ex- 
pression (SCDE) analysis with these cells, 
comparing their transcriptomes with the tran- 
scriptomes of other muscle cells, and identi- 
fied enriched transcripts (table S2). A similar 
analysis was also performed for NBCs (table 
$3). One of the few transcription factors ex- 
pressed in NMEs and NMCs in the scRNA- 
seq data was soxP-5. (soxP-5 expression was 
difficult to detect by FISH.) sowP-5 was also 
broadly expressed in muscle cells, including 
muscle progenitors, but not in photoreceptor 
neurons or other cell types (Fig. 6H, fig. S12A, 
and table S2). 

RNAi of soxP-5 led to a significant decrease 
in the numbers of both NMEs and NMCs in 
uninjured (Fig. 61) or double-eye-resected ani- 
mals (fig. S12B), indicating that it is required 
for production of normal numbers of these 
specialized muscle cells. After head amputa- 
tion, soxP-5 RNAi animals regenerated head 
blastemas that were normal in macroscopic 
size, appearance, and brain architecture (fig. 
$12C). However, these blastemas displayed 
lower numbers of NMEs and NMCs—in most 
soxP-5 RNAi animals, multiple NMEs and 
NMCs did regenerate, but a minority of ani- 
mals had very few or no detectable NMEs and 
NMCs (Fig. 6J). In eye-resected sowP-5 RNAi 
animals, the axonal projection pattern remained 
mostly unaffected, as expected, because new 
axons could follow existing axonal tracts in 
their development (fig. S12B). In regeneration 
after amputation, the visual system must form 
de novo, allowing assessment of pattern in ani- 
mals with reduced NMEs and NMCs. Whereas 
most soxP-5 RNAi animals regenerated the 
stereotypical axonal projection pattern, around 
20% of the soxP-5 RNAi animals exhibited 
defects in this pattern (labeled as soxP-5* in 
Fig. 6, K and L, and fig. S12, D to F). sowP-5* 
RNAi animals showed ectopic axon bundles 
at the optic chiasm, abnormal fasciculation, 
and misrouting of the axon bundles at the 
choice points (Fig. 6, K and L, and fig. S12, D 
and E). We found that the soxP-5* animals were 
those that had a severe reduction in NMEs and 
NMCs: soxP-5* RNAi animals had significantly 
fewer NMEs (p = 0.0029) and NMCs (p = 0.02) 
than did soxP-5 RNAi animals with a relatively 
normal axonal projection pattern (fig. S12F). 
Normal numbers of muscle cells were observed 
in intact (fig. S12B) and regenerating sowP-5* 
RNAi animals (fig. S13, A and B), indicating 
that soxP-5 is not essential for general muscle 
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regeneration or maintenance per se. Further- 
more, NBCs, as well as other neuronal subsets, 
regenerated normally in sowP-5* RNAi ani- 
mals, indicating a specific effect on the muscle 
cells associated with the photoreceptor axons 
(fig. S13, C and D). The correspondence be- 
tween low numbers or absence of NMEs and 
NMCs and the quality of the regenerated axo- 
nal projections provides further evidence for a 
guidepost-like role for these muscle cells. Our 
results suggest that NME, NMC, and NBC 
guidepost-like cells, together with constitutive 
guidance cues expressed broadly in the ani- 
mal, contribute to the precise overall assembly 
and pattern of the visual system. 


Discussion 


Orchestration of precisely wired neural cir- 
cuits relies on multiple developmental pro- 
cesses. Axon pathfinding is a highly precise 
process, and conserved mechanisms of guid- 
ance are used by different organisms. Guidance 
mechanisms can act in concert and include 
long-range or local chemoattractants and che- 
morepellents (guidance cues), transient cell-cell 
interactions (guidepost cells), and fasciculation 
of axons with pioneer or preceding axon tracts 
that form an initial scaffold. Once the circuit is 
established, some of these mechanisms become 
dispensable for the maintenance of pattern. 
The transient nature of these developmental 
mechanisms creates a problem for repair after 
injury: If an injury removes a patterned cir- 
cuit, the information for bringing the circuit 
back would be lost and might result in irrep- 
arable damage to the nervous system, even if 
the mechanisms for new neuron production 
existed. However, functional regeneration of 
the nervous system in some animals suggests 
that mechanisms for de novo restoration of 
the precise pattern of neuronal circuits in 
adults must exist. 

We identified a population of muscle cells 
expressing notum and frizzled 5/8-4 located in 
close proximity to axon tracts at key locations 
in the planarian visual system. The tight asso- 
ciation and precise positioning of these mus- 
cle cells suggest that they have a role in the 
assembly of the planarian visual system. This 
finding led us to investigate the possible exis- 
tence of guidepost-like cells in nervous system 
regeneration. Guidepost cells in other orga- 
nisms are often transient neurons or glia. These 
planarian guidepost-like cells were not only 
present in regeneration but also constitutively 
maintained in the adult. It is atypical for a 
muscle cell to exert a guidepost-like role. How- 
ever, planarian muscle cells are not only con- 
tractile, they also secrete extracellular matrix 
proteins (63) and harbor positional informa- 
tion required for regeneration and tissue turn- 
over (27, #7). In addition, specialized subsets 
of muscle cells serve as anterior (64) and pos- 
terior organizers (65, 66). Our findings suggest 
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that non-neuronal connective tissue-like cells 
can promote neuronal circuit formation in 
regeneration. 

Our results suggest that notum*; frigzled 
5/8-4* muscle cells can locally attract and/or 
stabilize photoreceptor axons, facilitating for- 
mation of visual axon bundles and possibly 
helping with axon sorting at choice points 
(Fig. 6M). Axon-axon interactions are critical 
for organizing axons into bundles and are 
required for proper topographic pathfinding. 
The mechanisms involved in axon bundle for- 
mation described in other organisms include 
either interactions of adhesion molecules ex- 
pressed on the axons or repulsive interactions 
between axons and surrounding tissues [re- 
viewed in (67)]. The molecules required in 
planarians to facilitate this process remain 
unknown, although members of the immuno- 
globulin or cadherin superfamilies are appeal- 
ing candidates to play roles in this process. 
Guidepost cells in other systems exert their 
function in guiding pioneer axon tracts by 
secreting attractive or repulsive guidance cues. 
Inhibition of several genes encoding homologs 
of canonical guidance cues did not result in 
defective attraction of the visual axons toward 
the planarian guidepost-like cells, suggesting 
that different molecules are at play or that there 
exists some redundancy within the system. Our 
findings suggest a model in which guidepost- 
like cells act in concert with constitutively 
active global axon guidance cues to facilitate 
precision in the regeneration of the visual 
system. 

Our study also identified two different tran- 
scription factors required for the specification 
of distinct subsets of cells with guidepost-like 
attributes. arrowhead, a Lim domain homeo- 
box gene, is involved in the specification of 
NBCs, whereas soxP-5 is required for the 
specification of both NMEs and NMCs. NMEs 
and NMCs show differences not only regard- 
ing their position but also, based on our ob- 
servations from tolloid RNAi animals, in the 
expression of different programs to exert their 
function. NMCs were sensitive to tolloid inhi- 
bition, whereas NMEs remained unaffected. 
Some of these genes likely regulate systems 
beyond the NMEs and NMCs. Previously iden- 
tified targets of the Tolloid protease include 
the BMP inhibitor Chordin, a homolog for 
which has not been found in planarians. 
However, increasing evidence suggests that 
members of the Astacin-like family of metallo- 
proteases can also process extracellular matrix 
components and certain TGFB-family ligands, 
and they are likely candidates for modulat- 
ing the activity of guidance cues or their re- 
ceptors (56). 

How intermediate target or guidepost cells 
in different organisms are specified at precise 
locations has not been fully elucidated. Our 
observations suggest a model in which muscle 


26 June 2020 


guidepost-like cells are coarsely specified in a 
permissive region proximal to the eyes, which 
is, at least in part, defined by an axial PCG ex- 
pression map (Fig. 6M). Density maps for the 
muscle guidepost-like cells suggest that these 
cells might be maintained only at the positions 
where photoreceptor axons are present, as 
indicated by their tight association with pho- 
toreceptor axons. This feature may allow dy- 
namic maintenance of muscle guidepost-like 
cells in precise locations and allow regener- 
ation of these cells in the face of an unlimited 
array of regenerative challenges. 

Strategies for neuronal patterning in the 
absence of embryo-specific contexts likely will 
also be crucial to overcome challenges in re- 
generative medicine. By studying a naturally 
occurring context of de novo formation of a 
neural system in regeneration, we found that 
small populations of cells, extrinsic to the sys- 
tem itself, provide information to help pre- 
cisely pattern and assemble axonal projections 
in regeneration. Our results expand the diver- 
sity of strategies used by different organisms 
to organize and properly build the nervous sys- 
tem, especially in the context of injury repair. 


Materials and methods 
Animal husbandry 


S. mediterranea clonal asexual strain CIW4 ani- 
mals, starved for 7 to 14 days before experimen- 
tation, were used for all experiments except for 
in situ hybridizations in embryos (Fig. 1). All 
animals used were healthy, not previously 
used in other procedures, and were of wild- 
type genotype. Animals were cultured in plas- 
tic containers or petri dishes for experiments, in 
1x Montjuic water (1.6 mmol/liter NaCl, 
1.0 mmol/liter CaCl., 1.0 mmol/liter MgSO,, 
0.1 mmol/liter MgCly, 0.1 mmol/liter KCl, and 
1.2 mmol/liter NaHCO; prepared in Milli-Q 
water) at 20°C in the dark. Animals were fed 
blended calf liver. S. polychroa lines were gen- 
erated by amputation of a starter animal, and 
the line was initially propagated through suc- 
cessive rounds of amputation. After generat- 
ing a large colony, sexual reproduction ensued. 
Animals were fed homogenized calf liver once a 
week and cleaned twice weekly. Animals were 
kept in the dark at 20°C and maintained in 
1x Montjuic water. Egg capsules were collected 
and staged daily (68). 


Replication, size estimation, and randomization 


At least two independent FISH and immuno- 
staining experiments with a minimum of three 
animals per experiment were performed for 
the characterization of muscle guidepost cells 
in intact and regenerating adults and in em- 
bryos. For RNAi phenotype characterization, 
numbers of animals used in each staining are 
indicated in each panel. No sample size esti- 
mation was performed. Animals for all experi- 
ments were randomly selected from a large 
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collection of clonal animals. All animals have 
been included in statistical analyses, and no 
exclusions have been done. Images were ran- 
domized before quantification. 


Gene nomenclature 


Genes that encode proteins with a clear do- 
main structure have been assigned a name 
accordingly but have also been identified 
using a transcriptome contig id number to 
facilitate identification (figs. S7 and S8). 


Drop-seq clustering and SCDE analysis 


To obtain transcriptomes for NMEs and NMCs, 
data from targeted single-cell sequencing of the 
planarian brain [(62), GEO accession number 
GSE111764, BrainClusteringDigitalExpression- 
Matrix.dge.txt.gz] was used to identify cells 
positive for expression of notum (dd_Smed_ 
v4_24180_0_1) and f5/8-4 (dd_Smed_v4_ 
11823_0_1) and negative for expression of 
the anterior pole marker foxD (dd_Smed_v4_ 
23249 _0_1) and the neuronal markers ChAT 
(dd_Smed_v4_6208_0_1) and pc2 (dd_Smed_ 
v4_1566_0_1) (log-scale expression of 0.5 for 
all genes). To identify these cells within the 
single-cell sequencing data, the Seurat func- 
tion WhichCells [subset.name=contig ID, 
accept.low=0.5] was used with Seurat pack- 
age, v2.2 (69). Fifteen cells satisfied all gene 
expression thresholds (Cells_Head1_AAG- 
TCTCACGCC, Cells_Head1_CAGACCTTCCCC, 
Cells_Head1_CGTGACTAAGAA, Cells Head1_ 
GGCGTGGTGACN, Cells_Head1_TAAATTC- 
GATAG, Cells_Head1_TTTACTTTCGAT, Cells_ 
Head2_AACGCCATTTCC, Cells_Head2_AG- 
TATGAATATG, Cells_Head2_AGTCACTAA- 
CAA, Cells _Head2_CACCGTGTACTA, Cells_ 
Head2_CCAGATAACGCA, Cells _Head2_CGT- 
AACTATCGT, Cells_Head2_GGTTACAGCTTT, 
Cells_Head2_GTTCCATGAAGN, Cells_Head2_ 
TGCTGTGCATCT). Of these 15 cells, eight cells 
were assigned a muscle identity in (62) (Cells_ 
Head2_AACGCCATTTCC, Cells_Head2_AGTA- 
TGAATATG, Cells_Head2_AGTCACTAACAA, 
Cells_Head2_CACCGTGTACTA, Cells_Head2_ 
CCAGATAACGCA, Cells_Head2_CGTAACTA- 
TCGT, Cells_Head2_GGTTACAGCTTT, Cells_ 
Head2_TGCTGTGCATCT). Additional data on 
identified cells can be found in table S1 of (62). 
An expression matrix for all cells identified 
as muscle in the targeted brain sequencing 
from (62) was generated, tagging the eight 
muscle cells above as positive and the remain- 
ing 522 muscle cells as negative. This expres- 
sion matrix was used as input for the R package 
SCDE (70). SCDE analysis revealed a list of 
genes enriched in these putative guidepost 
cells. Genes were ranked by their “conservative 
expression” value. To obtain transcriptomes for 
notum* brain neurons, cells identified as neu- 
ral from the targeted brain sequencing were 
combined with all cells identified as neural in 
the principal single-cell sequencing from (62). 
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Cells positive for expression of both ChAT (log- 
scale expression of 2.5) and notum (log-scale 
expression of 2) were identified, as above. 
Thirteen cells satisfied both gene expres- 
sion thresholds (Cells_Head_TCATCCACGCTT, 
Cells_Head_ACAATGTTTGGT, Cells _Head_ 
GTCAGACTCAGN, Cells_Head_CCTGTCGGC- 
TCN, Cells _Head_AGGGCGTAGTAA, Cells_ 
Head2_GTCGTCGTTGCG, Cells_Head2_GGC- 
CCGAGGATG, Cells_Whole_CGAACCAATAGT, 
Cells_Pharynx_AGTACAATGTGN, Cells Head2_ 
AAACCAAGCCAG, Cells _Head2_AGTTAGGCA- 
CAN, Cells_Pharynx_GGTAGGTTATCG, Cells_ 
Pharynx_AAGTAGGCATCG). Of these 13 cells, 
three had been isolated from the planarian 
pharynx (Cells_Pharynx_AGTACAATGTGN, 
Cells_Pharynx_GGTAGGTTATCG, Cells_ 
Pharynx_AAGTAGGCATCG) and were thus 
discarded. Additional data on identified cells 
can be found in table S1 of (62). An expression 
matrix for all ChAT* neurons (2103 cells) was 
generated, tagging the name of each neuron as 
positive or negative for expression of notum. 
This expression matrix was used as input for 
analysis by SCDE, as above, revealing a list 
of genes enriched in notum* neurons. Genes 
were ranked by their “conservative expres- 
sion” value. 

For all cells analyzed in table S1, unique 
molecular identifier (UMI) counts were log 
normalized by dividing by the total number of 
UMIs per cell, then multiplying by 10,000. All 
calculations were performed in log space [i.e., 
In(UMIs-per-10,000+1)]. The 50 transcripts with 
the highest average normalized expression in 
the eight putative guidepost-like cells were then 
identified, and their expression was compared 
with 10 randomly chosen non-guidepost muscle 
cells (chosen from cluster 9 of the Drop-seq 
data), 10 randomly chosen ciliated epidermis 
cells (chosen from cluster 19 of the Drop-seq 
data), and 10 randomly chosen neural cells 
(chosen from clusters 0, 1, 5, 17, 20, 23, 24, 25, 
26, 27, and 28 of the Drop-seq data). Averages 
indicate the average normalized UMI counts 
for each group of cells. Muscle enrichment and 
broad expression determinations were made 
using the online resource digiworm.wi.mit.edu 
from (62). Namely, genes enriched in clusters 7, 
13, 14, or 16 of the main clustering data from 
(62), but not broadly expressed across most 
clusters, were designated muscle enriched. All 
of the top 50 transcripts expressed in the puta- 
tive guidepost cells were either muscle enriched 
or broadly expressed, including in muscle. 


Gene cloning 


Homologs of guidance cues and transcription 
factors in planarians were cloned using the 
following primers: 
roboC (dd_Smed_v4:_'7921_0_1), fwd: 5’ atgg- 
tgccattgtcccgg; rv: 5’ aaccgagagttgccegste; 
roboD (dd_Smed_v4_14150_0_1), fwd: 5’ tgct- 
caatcgtcagataccg; rv: 5’ accgggaattcgaaaagact 


26 June 2020 


unc-5A (dd_Smed_v4_10380_0_1), fwd: 5’ 
ttgctcctagcgetcttcat; rv: 5’ tgtacgceggaattgctactg 
unc-5B (dd_Smed_v4_10585_0_1), fwd: 5’ 
tcttgagccacaaccctttt; rv: 5’ ccagttcgatatccgaagga 
unc-5C (dd_Smed_v4_10730_0_1), fwd: 5’cc- 
aactcgggaaattgaaga; rv: 5’ ccgaaacaaaagetggagaa 
unc-5D (dd_Smed_v4_16435_0_1), fwd: 5’ cc- 
ctcaaggaacaaaatgga; rv: 5’ aaatttcccaatcgggtttc 
ephR-1 (dd_Smed_v4_16483_0_1), fwd: 5’ 
ecgatcacttttcagccaat; rv: 5’ gtgaggttgectgattccat 
ephR-2 (dd_Smed_v4_16928_0_1), fwd: 5’ 
gttcctctgatgtgcccagt; rv: 5’ agatccggcatgaatctgac 
ephrin (dd_Smed_v4:_16552_0_1), fwd: 5’ teca- 
gcaagatatgccgata; rv: 5’ tectgaaaaactgataattgaaaca. 
ephrin (dd_Smed_v4_10687_0_1), fwd: 5’agt- 
cattcacggtccaggc; rv: 5’ cccatgaaaaacaggttcaaaag 
arrowhead (dd_Smed_v4_4:7123_0_1), fwd: 
5’ gttgcaaagctagctcaatttca; rv: 5’ acgggatatggatt- 
aacttgaca 

plexin (dd_Smed_v4_11934_0_1), fwd: 5’ gc- 
gagtgtggtegeaaaaat; rv: 5’ ccaagacgcaccaagaacaa 

semaphorin-1(dd_Smed_v4_12018_0_1), 
fwd: 5’acctgaaatccctttcactcgt; rv: 5’ tecagctgtg- 
taagagagga 

soxP-5 (dd_Smed_v4_9050_0_1) accession 
number: GenBank JX010525.1. 

All constructs were cloned from cDNA 
into the pGEM vector (Promega). These con- 
structs were used to synthesize RNA probes 
and double-stranded RNA (dsRNA) for RNAi 
experiments. 


RNAi 


For RNAi experiments, dsRNA was synthesized 
by in vitro transcription reactions (Promega) 
using polymerase chain reaction (PCR)- 
generated templates with flanking T7 pro- 
moters, followed by ethanol precipitation, 
and annealed after resuspension in water. 
The concentration of dsRNA varied in each 
prep between 4 and 7 ug/ml. dsRNA was then 
mixed with planarian food (liver) (36), and 
2 wl of this mixture per animal (liver contain- 
ing dsRNA) was used for feedings. For the 
guidance cue screen (fig. S8), animals were fed 
six times in 3 weeks. For the PCG screen (Fig. 5 
and fig. S9), animals were fed between 6 and 
10 times in 3 to 5 weeks until phenotype was 
observed. ovo RNAi animals for homeostasis 
experiments (Figs. 3 and 5 and fig. S9) were 
fed 8 to 12 times during a 4- to 6-week period. 
ovo RNAi regeneration experiments (eye re- 
section and decapitation; Fig. 3) animals were 
fed eight times in 4 weeks. ovo RNAi ani- 
mals for transplantation experiments were 
fed eight times in 4 weeks (Fig. 4 and figs. S5 
and S6). ndk; ndl4 RNAi animals were fed 
until posterior eyes appeared (8 to 12 feedings). 
For RNAi “wear off’ experiments, animals were 
not fed for 2 months before a midline sagittal 
cut was performed (Fig. 5). arrowhead RNAi 
animals (Fig. 6 and fig. S11) were fed six times 
over a period of 3 weeks. tolloid RNAi ani- 
mals were fed only once for regeneration 
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experiments and twice for homeostasis ex- 
periments (Fig. 6 and fig. S10). sowP-5 RNAi 
animals were fed eight times in a 4-week pe- 
riod of time (Fig. 6 and figs. S12 and S13). All 
feedings were performed every 3 days. In all 
cases, animals were fixed 7 days after the last 
feeding. For regeneration experiments, ani- 
mals were amputated into three pieces (head, 
trunk, and tail pieces) 1 week after the last 
RNAi feeding. Seven days after amputation, 
trunk pieces were scored and fixed for further 
analysis. 


FISHs and immunostainings 


RNA probes were synthesized, and whole- 
mount FISH was performed (36). Briefly, ani- 
mals were killed in 5% N-acetyl cysteine (NAC) 
and treated with proteinase K (2 ug/ml). After 
overnight hybridizations, samples were washed 
twice in each of prehybridization buffer, 1:1 
prehybridization-2X SSC, 2X SSC, 0.2K SSC, 
and phosphate-buffered saline (PBS) with 
Triton-X (PBST). Subsequently, blocking was 
performed in 10% Western Blocking Reagent 
(Roche, 11921673001) PBST solution for di- 
goxigenin (DIG) probes, or in 5% horse serum 
and 5% casein for dinitrophenol (DNP) and 
fluorescein (FITC) probes. Antibody washes 
were then performed for 1 hour followed by 
tyramide development. Peroxidase inactiva- 
tion with 1% sodium azide was done for 90 min 
at room temperature. Brightfield images were 
taken with a Zeiss discovery microscope. Flu- 
orescent images were taken with a Leica SP8 
confocal microscope. Colocalization analyses 
of FISH signals were performed using Fiji/ 
ImageJ. Three-dimensional (3D) reconstruc- 
tion for the movies was performed using 
Imaris 3/4D Image Visualization and Anal- 
ysis software. For each channel, histograms of 
fluorescence intensity were used to determine 
the cut-off between signal and background. 
All FISH images shown are representative of 
all images taken in each condition and are 
maximal intensity projections. All images, 
unless otherwise indicated, are anterior up. 
For immunostainings with anti-Arrestin anti- 
body (VC-1) or anti-muscle antibody (6G10), 
animals were fixed as for in situ hybridiza- 
tions, blocked in 3% bovine serum albumin 
(BSA)-PBST or in 10% Western Blocking Re- 
agent (Roche, 11921673001) PBST solution, 
respectively, for 1 hour and then stained with 
the antibody of interest. The anti-muscle mouse 
monoclonal antibody 6G10 (RRID: AB_2619613) 
(71) was used in a 1:1000 dilution, the anti- 
Arrestin mouse monoclonal antibody VC-1 
was used in a 1:5000 dilution, the anti-Arrestin 
rabbit polyclonal antibody and the anti-a 
tubulin antibody (Lab Vision catalog no. MS- 
581, RRID: AB_144075) were used in a 1:500 
dilution, and anti-mouse Alexa conjugated 
antibodies (Life Tech) were used in a 1:500 
dilution. The pool of probes used for label- 
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ing muscle cells included troponin, tropo- 
myosin, colF-2, and colF-10 (Fig. 1 and fig. $2). 


S. polychroa (Spol) FISH 


Whole-mount FISH was performed as de- 
scribed above for S. mediterranea with the 
following modifications: (i) Stage 2 to 5 em- 
bryos were dissected out of egg capsules and 
fixed in 4% formaldehyde for 2 hours. Embryos 
were washed in PBST-0.5% for 10 min and 
dehydrated in 25, 50, 75, and 100% PBST: 
methanol for 10 min. Fixed embryos were not 
bleached and were stored at —20°C. Protein- 
ase K treatment was extended to 20 min. (ii) 
Stage 6 and 7 embryos were dissected out of 
egg capsules and fixed in 4% formaldehyde for 
1 hour. Embryos were washed in PBST-0.5% 
for 10 min and dehydrated in 25, 50, 75, and 
100% PBST:methanol for 10 min each. Em- 
bryos were washed in PBST-0.5% for 10 min 
and dehydrated in 25, 50, 75, and 100% PBST: 
methanol for 10 min. Fixed embryos were not 
bleached and were stored at —20°C. Protein- 
ase K treatment was extended to 15 min. (iii) 
Hatchlings were treated with 5% NAC PBS 
with gentle rotation for 5 min and then fixed 
in 4% formaldehyde for 1 hour. Hatchlings 
were then washed in PBST-0.5% for 10 min 
and dehydrated in 25, 50, 75, and 100% PBST: 
methanol for 10 min. Hatchlings were stored 
at -20°C and bleached in formamide bleach- 
ing solution for 1 hour at room temperature. 
Proteinase K treatment was extended to 12 min. 
All embryos were mounted in Vectashield 
between two coverslips and both sides were 
imaged on a Leica SP8 confocal microscope. 


Eye resections and eye transplantations 


To selectively resect eyes, animals were placed 
on moist filter paper on a cold block to limit 
movement, and the tip of a microsurgery blade 
was used to remove eyes. For eye transplants, 
after anesthesia using 0.2% chloretone in 
planarian H,O for 2 min, a thin slit cut was 
made to desired locations of the recipient 
animals and a small hole was generated by 
gently moving the surgical blade up and down 
within the slit cut. Recipient animals were 
washed in Holtfreter’s solution for 2 min and 
rested in 0.2% chloretone for 2 min, briefly 
washed in Holtfreter’s solution, and trans- 
ferred on the cold block to introduce the ex- 
cised eyes. Eye resections were performed using 
a dissecting microscope by trimming the pig- 
mented tissue around the eyes with a surgical 
blade, leaving only the white area and the 
visible optic cup of the eyes. The pigmented 
ventral side of this tissue was also trimmed 
away before transplantation. The eyes were 
gently pushed inside the previously gener- 
ated holes in the recipient animals. Trans- 
planted animals were immobilized using Type 
IV, 5% ultra-low melting agarose (Sigma) 
on top of WhatmanTM (GE Healthcare, Life 
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Sciences) filter paper. Solidified gel was cov- 
ered using Rasta Royale ultrathin rolling 
paper soaked in Holtfreter’s solution. Trans- 
planted animals were kept in 10°C overnight 
and recovered by cutting the gel around and 
also on top of the animal. Animals were 
placed in planarian H,O and kept at 22°C for 
recovery. 


Quantifications and statistical analysis 


Total numbers of NMEs and NMCs were 
counted based on expression of the markers 
notum and fz5/8-4, the location of their nu- 
clei on the dorsal-ventral axis, and by using, 
as a reference, the brain architecture with a 
4',6-diamidino-2-phenylindole (DAPI) nuclei 
staining. Total numbers of muscle guidepost 
cells were also counted and graphed relative 
to the total length of the animal expressed in 
micrometers. 

NME and NMC cell position and density 
maps were generated by building an ideal- 
ized planarian visual circuit and positioning 
it in relation to the brain commissure and the 
anterior pole determined by DAPI staining 
and notwm expression, respectively. This trace 
was placed on four (for ovo RNAi only) or nine 
(for all other conditions) equal zones in the 
Adobe Illustrator software and the approxi- 
mate positions of NMEs and NMCs in relation 
to the photoreceptor axons, brain commis- 
sures, and anterior poles was mapped manu- 
ally. Heatmaps indicating relative positions of 
NMEs and NMCs were generated by counting 
the NMEs and NMCs that fall into each of the 
nine zones. The range was determined by the 
highest and the lowest total number of cells 
that were counted in each zone. 

Axonal trajectories were manually traced in 
the Adobe Illustrator software after transfer- 
ring a maximum intensity projection of the 
image containing the planarian visual circuitry 
into a new document. NMEs and NMCs were 
mapped onto the traces based on their posi- 
tion in relation to the visual axons using FIJT’s 
channels tool, determining the dorsal-ventral 
position for each cell. To determine the sig- 
nificance of axonal interactions with NMEs, 
NMCs, and NBCs (Fig. 2), axons from 6 to 
10 separate eyes, for each group, were traced 
manually in the same manner described above. 
The trace maps were placed on a circular area 
that is divided into 36 equal sections (10° each). 
NME, NMC, and NBC cell distributions in 
reference to the center point of the circle (eye) 
were mapped, and angular median values and 
95% confidence intervals for cell positions 
were calculated for each group. NME, NMC, 
and NBC numbers and the number of axonal 
intersections in each pie shaped section shown 
in Fig. 2 were counted, and bar graphs were 
generated using GraphPad Prism software af- 
ter linearizing the circular arenas. Coincidence 
of NMEs, NMCs, and NBCs and axons was 
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represented by a black box around the area 
they all occupy (fig. S3). Local axonal trajecto- 
ries were traced in relation to NMEs and 
NMCs in “d7 after eye transplant” animals, 
and axons that extend within two NME or 
NMC cell diameters were mapped in circular 
fields. An overlay map of these circular fields 
was generated to build a density map of axons 
that project to the vicinity of NMEs and NMCs 
(Fig. 4 and fig. S6). 

One-way analysis of variance (ANOVA) test 
followed by Dunnett’s multiple comparison 
test was used when analyzing more than two 
conditions. Unpaired Student’s ¢ test was used 
when comparing two conditions. Mean + SD 
is shown in all graphs. 
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Flexible recruitment of memory-based choice 
representations by the human medial frontal cortex 


Juri Minxha, Ralph Adolphs, Stefano Fusi, Adam N. Mamelak, Ueli Rutishauser* 


INTRODUCTION: Decision-making in complex 
environments relies on flexibly combining 
stimulus representations with context, goals, 
and memories. A central component of cogni- 
tive flexibility is to selectively retrieve infor- 
mation from memory and utilize the retrieved 
information to make decisions. The medial 
frontal cortex (MFC) plays a critical role in 
this process by representing task sets, context, 
and outcomes. During decision-making, the 
MFC is thought to selectively engage mem- 
ory retrieval by representing memory-based 
choices and mediating interactions between 
the frontal lobes and the hippocampus and 
amygdala (HA) through phase-locking of MFC 
activity to oscillations in the HA. It remains 
unknown what features of decisions and con- 
text are represented in the human MFC and 
what functional interactions between the MFC 
and HA mediate dynamic memory retrieval 
during a task. 
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RATIONALE: We recorded single neurons and 
local field potentials in the human MFC and 
HA in patients implanted with depth elec- 
trodes. Subjects switched between two tasks: 
recognition memory and categorization. To 
identify signatures of task demands, we com- 
pared the strength of encoding of stimulus 
familiarity, category, and choices between tasks 
and tested whether decoders trained in one 
task generalized to the other task. Such cross- 
task generalizability would indicate abstract 
representations of the underlying variables. 
We hypothesized that this approach would 
reveal neural signatures of the representations 
and functional interactions that permit memory- 
based decisions. 


RESULTS: We recorded from 1430 single neurons 
in the HA and MFC [dorsal anterior cingulate 
cortex (dACC) and the pre-supplementary 
motor area (pre-SMA)] across 13 patients. 
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Flexible representations of choices in the human frontal lobe. (A) Recording locations. LFP, local field 
potential. (B) Population response of all recorded neurons (left) and example of a cell signaling memory- 
based choices (right). (© and D) Representational geometry analysis reveals that different subspaces are 
used by the two tasks, establishing a memory-specific decision axis. (E) Theta- and gamma-band coherence 
of MFC choice cells with HA LFPs increased during the memory task. 
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Subjects made “yes” or “no” decisions using 
button presses or saccades (eye movements) 
to indicate whether an image was novel or 
familiar, or whether an image belonged to a 
given visual category. Instructions were given 
before each block of trials, 
explaining the task and 
Read the full article | T@Sponse modality to use 
at https://dx.doi. (i.e., task set). Examining 
org/10.1126/ the underlying neural rep- 
science.aba3313 resentations at the single- 
neuron and population 
levels revealed the following: (i) Cells in the 
MFC represented task set during baseline pe- 
riods. These contextual signals emerged rapidly 
after a task switch and generalized across all 
response and task-type combinations in the 
MFC but not the HA. (ii) The strength and 
geometry of representations of familiarity 
were task-insensitive in the HA but not in the 
MEC. The responses of these memory-selective 
cells were a reflection of memory strength 
rather than decisions about the memory. 
Gii) The visual category of stimuli was repre- 
sented more strongly during the memory task in 
both the MFC and HA. This encoding of category 
generalized across tasks fully in the HA but 
not the MFC. (iv) Choices in both tasks were 
most strongly represented by cells in the 
MEC. This choice representation differed in its 
population-level geometry between the two 
tasks but was insensitive to response modality 
(button press or saccade). One subset of MFC 
cells signaled only memory-based choices, and 
these cells signaled decisions about the mem- 
ory. (v) MFC cells phase-locked their activity 
to theta-frequency band oscillation in the 
HA preferentially in the memory task, with 
memory-choice cells also phase-locking in the 
gamma-frequency band. The strength of this 
interareal phase-locking in both frequency 
bands of the MFC cells that signaled memory- 
based choices was predictive of behavior. 


CONCLUSION: We leveraged the opportunity 
to record from single neurons in humans to 
identify representations of choices, task sets, 
stimulus category, and familiarity in the human 
MFC and HA. We found that neuronal pop- 
ulations within the MFC formed two separate 
decision axes: one for memory-based decisions 
and another for categorization-based decisions. 
MFC-HA theta-frequency functional connect- 
ivity was selectively enhanced during memory 
retrieval. This work reveals a neuronal mecha- 
nism in the human brain whereby oscillation- 
mediated coordination of activity between distant 
brain regions and accompanying changes in 
strength of representation and/or geometry im- 
plements task-dependent retrieval of memory. ! 


The list of author affiliations is available in the full article online. 
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Flexible recruitment of memory-based choice 
representations by the human medial frontal cortex 


Juri Minxha??7, Ralph Adolphs?“, Stefano Fusi*, Adam N. Mamelak’, Ueli Rutishauser'2>->* 


Decision-making in complex environments relies on flexibly using prior experience. This process depends 
on the medial frontal cortex (MFC) and the medial temporal lobe, but it remains unknown how these 
structures implement selective memory retrieval. We recorded single neurons in the MFC, amygdala, and 
hippocampus while human subjects switched between making recognition memory-based and 
categorization-based decisions. The MFC rapidly implemented changing task demands by using different 
subspaces of neural activity and by representing the currently relevant task goal. Choices requiring 
memory retrieval selectively engaged phase-locking of MFC neurons to amygdala and hippocampus field 
potentials, thereby enabling the routing of memories. These findings reveal a mechanism for flexibly and 
selectively engaging memory retrieval and show that memory-based choices are preferentially 


represented in the frontal cortex when required. 


ehavior in complex environments re- 

quires decisions that flexibly combine 

stimulus representations with context, 

goals, and memory. Two key aspects of 

cognitive flexibility are the selective uti- 
lization of relevant information depending on 
task demands and the retrieval of information 
from memory, when needed (1). We are begin- 
ning to understand the neural mechanisms 
that underlie flexible decisions in the case of 
perceptual decision-making (2-4), with evi- 
dence for both early gating, mediated by top- 
down attention (5), and late selection of relevant 
features in the prefrontal cortex (3). In contrast, 
little is known about the decision mechanisms 
that also depend on associated category knowl- 
edge and memory. In particular, it is not clear 
how memory retrieval is selectively engaged 
when decision-relevant information needs to 
be actively searched for in memory (6-8). 

The medial frontal cortex (MFC) is critical 
for complex behavior and registers cognitive 
conflict, errors, and choice outcomes (9-17). 
It supports flexible decision-making in two 
ways: (i) by representing task sets (12-4) and 
context (15), and (ii) by selectively engaging 
memory retrieval through functional inter- 
actions with other brain areas (16-18), specif- 
ically the hippocampus (19-27) and amygdala 
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(22, 23). A mechanism that facilitates such in- 
terareal interactions is phase-locking of MFC 
activity to osciollations in the hippocampus or 
amygdala. This mechanism has been exten- 
sively investigated in rodents during spatial 
behavior (24-26) and fear conditioning (27, 28), 
but its broader function remains poorly under- 
stood (29), particularly in humans. Similarly, 
human neuroimaging studies indicate that the 
MFC is involved in memory search (8, 18, 30-34) 
and that patterns and level of activity and con- 
nectivity assessed by functional magnetic re- 
sonance imaging (fMRI) vary as a function of 
retrieval intentionality (35-38). It is not yet 
known what features of decisions and context 
are represented in the human MFC, whether 
memory retrieval selectively engages synchrony 
between the MFC and the hippocampus and/or 
amygdala, and whether synchrony can be 
engaged dynamically when required. This lack 
of knowledge stands in stark contrast to the 
patent behavioral ability of humans to flexibly 
recruit memory processes in everyday life (39, 40) 
and to our detailed knowledge of memory rep- 
resentations in the human hippocampus and 
amygdala, where cells represent aspects of 
declarative memories, such as the familiarity 
and the identity of a stimulus (47-43). To ad- 
dress these open questions, we used simulta- 
neous recordings of single neurons and local 
field potentials (LFPs) in the human MFC, hip- 
pocampus, and amygdala. 


Task and behavior 


We recorded from 1430 single neurons across 
four brain areas (Fig. 1, C and D; see table S1; 
33 sessions in 13 subjects): 2 = 203, 460, 329, 
and 438 neurons from anterior hippocampus 
(HF), amygdala (AMY), dorsal anterior cingu- 
late cortex (dACC), and pre-supplementary 
motor area (pre-SMA), respectively. For brevity, 


26 June 2020 


we refer to HF and AMY together as HA (n = 
663 neurons) and to dACC and pre-SMA to- 
gether as MFC (nv = 767 neurons). 

Human subjects viewed a sequence of 
320 images, grouped into eight blocks of 
40 images each, in each session (Fig. 1, A and 
B). At the beginning of each block, subjects were 
instructed which decision to make and which 
response modality to use to communicate their 
decision. Subjects made a “yes” or “no” decision 
for each trial to indicate whether an image be- 
longed to a given visual category (“categorization 
task”) or whether an image had been seen before 
in the task or not (“memory task”). No feedback 
was provided (see Materials and methods sec- 
tion for details on the task). Each image shown 
belonged to one of four visual categories: hu- 
man faces, monkey faces, fruits, or cars. In each 
block, half of the images shown were repeated 
and half were novel (except in the first block, 
in which all images were novel). 

Subjects indicated choices using either sac- 
cades (leftward or rightward eye movement) 
or button press while maintaining fixation 
at the center of the screen (Fig. 1, E and F; 
mean + SD, 94 + 15% of all gaze positions fell 
within the image shown). Reaction times 
(RTs) were significantly longer in the mem- 
ory task than in the categorization task 
[Fig. 1G, mean RT of 1.48 + 1.1s versus 1.19 + 
1.2 s, respectively, P < 1 x 10°”°, two-sample 
Kolmogorov-Smirnov (KS) test, mean + SD 
across all trials in a given task]. Subjects 
performed with an average accuracy of 97 + 
6% versus 71 + 6% in the categorization and 
memory tasks, respectively (mean + SD across 
n = 33 sessions). This difference in accuracy 
remained after we matched for RT between 
the two tasks (96 + 6% versus 72 + 8% with 
matched RTs of 1.23 + 0.60 s versus 1.24: + 0.60 s 
for the categorization and memory task, respec- 
tively). Even without RT matching, the initial 
response in terms of arousal was not different 
between tasks, as assessed by pupillometry 
(fig. S1, J to L). In the memory task, accuracy 
increased as a function of how many times an 
image had been shown (Fig. 1H, Bappearances = 
0.56, P < 1 x 10°, mixed effects logistic 
regression; also see fig. S1, C and D, for effect 
of target versus nontarget on memory per- 
formance). Subjects had shorter RTs on “yes” 
(seen before) decisions than on “no” (novel 
stimulus) decisions in the memory task (fig. 
S1A, see legend for statistics), as expected from 
a medial temporal lobe (MTL)-dependent re- 
cognition memory task (47). In the categori- 
zation task, RT was not significantly different 
between the two responses (fig. SIA), showing 
the absence of oddball effects. 


Effects of task type and response modality 
in the MFC 


Instructions about the task type and response 
modality were shown at the beginning of each 
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Fig. 1. Task, electrode locations, and behavior. (A) Task structure. A 
session consisted of eight blocks of 40 trials. The task switched with each 
block (blue = categorization, red = memory), and the response modality 
switched halfway through each block (saccade or button press, randomly 
assigned at the beginning of the block). The subject was instructed about 
the task at the beginning of each block (purple arrows) and how to respond at 
the beginning and halfway points of each block (green arrows). (B) Example 
of screens shown to subjects for two example trials. (© and D) Electrode 


block (Fig. 1, A and B). Cells showed significant 
modulation of their firing rate during the 
baseline period as a function of task type 
(Fig. 2, A and B, shows an example in pre-SMA). 
At the single-neuron level, significantly more 
cells were modulated by task type in the MFC 
than in the HA: 25% of MFC cells (165/767, 82 
in dACC, 83 in pre-SMA; see fig. S2B) versus 
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Response time [s] 


12% of HA cells (79/663, 21 in HF, 58 in AMY), 
7 test of proportions, P < 1.5 x 10°. Similarly, 
at the population level, population decoding 
accuracy was significantly higher in the MFC 
than in the HA [Fig. 2C; 90% versus 70%, re- 
spectively; P < 1x 10°; true difference (Age) = 
20% versus empirical null distribution; see 
Materials and methods], a conclusion that held 
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locations. Each dot is the location of a microwire bundle in one subject. 
Coordinates are in Montreal Neurological Institute (MNI) 152 space. 

(E and F) Eye tracking data from one session from the button press (E) and 
eye movement (F) trials. (G) Reaction times as a function of task across 

all sessions (memory, w + SEM, 1.27 + 0.02 s; categorization, 0.90 + 0.02 s; 
P =7.6 x 10-8, two-sample KS test). (H) Memory performance improves 
over the course of the experiment (8 = 0.56, P = 8.42 x 10°, logistic mixed 
effects model). See fig. S1 for an extended summary of the behavior. 


regardless of the number of neurons used (fig. 
S2H). Cells also modulated their activity as a 
function of response modality during the base- 
line period (fig. S2E shows an example). As 
with task-type encoding, significantly more 
cells encoded response modality in the MFC, 
and this signal could be decoded with higher 
accuracy in the MFC than in the HA (14% versus 
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Fig. 2. Representations of task type and response modality. (A and 

B) Example pre-SMA neuron. (B) Average firing rate during the baseline 
period (-1 to 0 s relative to stimulus onset) for each block for the cell shown 
in (A). The average baseline firing rate across all blocks of the same type 
is shown. (€ and D) Population decoding of task type (C) and response 
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type switch (green arrows in Fig. 1A; transitions from categorization to 
memory and vice versa were pooled). Error bars 
with the exception of (B), where they indicate SE 
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ent trials with short reaction times was more 


accurate than decoding on long reaction time trials. Performance is shown 


modality (D). (E) Cross-condition decoding approach. The background 


color denotes the type of trials that were used to train a given decoder. 


(F) Cross-response modality decoding of task type from the baseline 


firing rate of all recorded cells. (G@) Cross-task decoding of response modality. 
(H) Decoding performance as a function of trial number relative to a task 


10% of cells; 84/593 versus 59/586 in the MFC 
and HA, respectively; 33 in ACC, 51 in pre-SMA, 
27 in HF, 32 in AMY; y’ test of proportions, 
P = 0.03; population decoding performance 
72% versus 64%, Fig. 2D; P< 0.05, Atrue = 8% 
versus empirical null distribution); this con- 
clusion held regardless of number of neurons 
used (fig. S21). 

After a task switch, contextual signals 
emerged rapidly within one to two trials in 
the new context in the MFC (Fig. 2H). This was 
not a result of ongoing poststimulus process- 
ing, because the task could still be decoded 
even if only considering the subset of task cells 
in the MFC, which did not differentiate between 
the tasks around response time (see fig. S2G). 
Task-switching costs were also reflected in the 
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separately for categorization (Cat) and memory ( 
and 7 x 1078, respectively, Wilcoxon rank sum te 
standard error in decoding accuracy across trials 


em) trials (P = 2 x 10°"! 
st). Error bars denote 
(80 trials in each of the 


four groups). See fig. S2 
effects by specific anato 


subjects’ longer reaction times shortly after a 
change in task or effector type (fig. S2A). Task- 
type representations during the baseline period 
were stronger on trials where the subject sub- 
sequently produced a fast response than on 
those where the response time was slow (Fig. 
21), indicating behavioral relevance. We also 
tested whether the two types of contextual 
signals were sufficiently robust to avoid inter- 
ference with one another, using a cross-condition 
generalization decoding analysis (44) (Fig. 2E). 
We first trained a decoder to discriminate task 
type on trials where the subject was instructed 
to reply with a button press, and then we tested 
the performance of this decoder on trials where 
the subject was instructed to use saccades 
(and vice versa). The two decoders generalized 
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for additional analyses that break down context 
mical regions. 


in the MFC but not in the HA (Fig. 2, F and G). 
For this reason, we focused on the MFC when 
conducting the analysis above. 


Cross-condition generalization of familiarity 
and image category 


Next, we examined whether the neural repre- 
sentations of image category and familiarity 
are sensitive to task demands. We assessed 
two consequences of task demands: gener- 
alization across tasks and strength of repre- 
sentations within each task. At the single-unit 
level, we examined visually selective (VS) cells 
(42), whose responses are thought to reflect 
input from high-level visual cortex, and memory- 
selective (MS) cells (42, whose response signals 
stimulus familiarity (Fig. 3, A and B, shows 
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Fig. 3. Representations of image category and familiarity (new versus 
old). (A and B) Example cells that (A) represent image category and (B) 
differentiate between new and old stimuli. (©) Decoding accuracy of image 
category from all recorded cells was significantly higher in the HA relative to 
the MFC (Atrue = 49%, P < 0.001). (D) Decoding of new versus old (ground 
truth) was similarly accurate in the HA and MFC (Atrye = 7%, P = 0.13). For 
new versus old decoding, trials with images of monkeys were excluded, 
because the recognition performance for these images was at chance (fig. 
S1B). (E) Population activity of all recorded HA (left) and MFC (right) cells, 
plotted in 3D using MDS. Individual points show the mean activity of the 
population for that specific condition. The highlighted plane contains all 
locations of state space occupied by a given task for the case of fruits 
versus faces as the binary category distinction (for illustration only; all 
analysis uses all categories). The geometry of the representation allows for 
a decoder that is trained on one task to generalize to the other task 

(see fig. S4C for example decoder hyperplanes). (F) Approach used for 

the cross-condition generalization analysis. Color indicates task (blue = 
categorization, red = memory). (Top) We trained a decoder to discriminate 
between new and old trials on categorization trials and then tested its 
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performance on new and old stimuli encountered during the memory 
condition (and vice versa). (Bottom) Similarly, a decoder that is trained to 
discriminate between image categories (in this example, faces versus fruits; 
all results include all six possible pairs) on categorization trials was tested on 
memory trials. (G@) Cross-condition generalization performance for image 
category. (H) Cross-condition generalization performance for new versus old. 
(I) Difference in cross-task generalization decoding accuracy for familiarity 
and image category between HA and MFC. Difference is computed between 
the average cross-task performances in each area (i.e., average of 
memory—>categorization and categorization—-memory). The null distribution 
for the average was estimated from the empirical null estimated by shuffling 
the labels used to train the decoders. For both variables, decoding from 

HA had significantly greater cross-task generalization performance than 
decoding from MFC (the difference in both cases is positive and outside of the 
95th percentile of the null distribution). (J) Generalization index (see 
Materials and methods) for memory (two data points on the left) and image 
category (two data points on the right). For both image category and 
familiarity, generalization across task was higher in the HA population than 
in the MFC population (see figure for statistics; A, difference). 
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examples). Of the HA cells, 40% were visually 
selective (264/663, 35 in HF and 229 in AMY) 
and 11% were memory-selective (73/663, 10 in 
HF, 63 in AMY; 24/73 were old>new selective 
and 49/73 were new>old selective; see Ma- 
terials and methods for selection model). In 
the MFC, 13% (103/767) of the cells were 
visually selective and 11% (84/767) were memory- 
selective. First, we performed single-neuron 
analysis of the selected HA cells. Visual and 
memory selectivity were present in both the 
memory and categorization blocks (figs. S3, 
D, E, H, and I). MS cell responses reflected a 
memory process: they strengthened over blocks 
as memories became stronger (fig. S3G), and 
they differed between forgotten old (false nega- 
tive, FN) and correctly identified new (true 
negative, TN) stimuli for both new>old (n = 49) 
and old>new (n = 24) preferring MS cells 
(fig. SIOF). Furthermore, these memory signals 
were behaviorally relevant: new/old decoding 
was significantly weaker in incorrect trials than 
in correct trials (fig. S10G). 

To answer the question of whether coding 
for visual and memory information general- 
izes across tasks, we took a population-level 
approach (over all single units, without se- 
lection). We used decoding performance on 
individual trials and single-neuron analysis 
(fig. S4, D and E) to assess whether the neural 
encoding of visual category and/or familiarity 
of a stimulus depended on task demands. In 
both the HA and MFC, image category could 
be decoded (Fig. 3C, 98 and 49% in the HA 
and MFC, respectively; chance level = 25%). 
Category decoding performance was signifi- 
cantly higher in the HA than the MFC (Aiue = 
49% versus empirical null distribution, P < 1 x 
10°). In the HA, the ability to decode category 
was not significantly different between the 
two tasks (Fig. 3C, 96% versus 99% in catego- 
rization and memory, respectively; Atrue = 3%, 
P = 0.25) and could be decoded above chance 
in both the HF and AMY (fig. S4, A and B). At 
the single-neuron level, HA neurons encoded 
significantly more information about category 
in the memory task (fig. S4D), an effect that 
decoders were not sensitive to because of satu- 
ration. In the MFC, decoding accuracy for image 
category was significantly higher in the memory 
task (Fig. 3C, 60% versus 36%, Ate = 24% versus 
empirical null distribution, P < 0.001). Memory 
was decodable in both the HA and MFC (Fig. 
3D, 69% versus 76%, respectively), with no sig- 
nificant difference in decoding accuracy be- 
tween the two tasks in the HA (Fig. 3D, 67% 
versus 71% in categorization and memory trials, 
respectively; Aue = 4%, P = 0.3) and signifi- 
cantly better decoding ability in the MFC in 
the memory task (Fig. 3D, 86% versus 61%; 
Atrue = 25% versus empirical null, P = 0.001). 
Single-neuron analysis confirmed the impres- 
sion from decoding that the strength of memory 
signals in the HA was not modulated by task 
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demands, whereas in the MFC it was (fig. S4E). 
In either task, at the population level, memory 
decoding in the HA was only possible in the 
amygdala (fig. S4B). The population-level de- 
coding of memory in the HA was principally a 
reflection of the signal carried by the MS cells 
(fig. SIOE) and was not due to repetition sup- 
pression of VS cells (fig. S10, B and C), because 
it was not possible to decode familiarity from 
VS cells alone (fig. S10D). 

To gain insight into the geometry of the 
population-level representations, we assessed 
whether the decoders trained to report famil- 
iarity and the category of the stimuli in one 
task would generalize to the other task (Fig. 3F 
schematizes our approach). Cross-task gener- 
alizability would indicate that familiarity and 
visual category are represented in an abstract 
format (44). First, cross-task generalization 
performance was greater in the HA than MFC 
for both image category (Fig. 3, G and I; 98% 
versus 41%, averaged across the two cross- 
condition decoding performances; Ate = 57% 
versus empirical null, P < 1 x 10°?) and 
familiarity (Fig. 3, H and I; 67% versus 55%, 
Atrue = 12% versus empirical null, P < 0.05). 
Second, to help understand the geometry of 
these neural representations, we projected the 
average HA and MFC population activity for 
all possible pairings of familiarity, image cate- 
gory, and task (eight different conditions) into 
a three dimensional (3D) state space using mul- 
tidimensional scaling. For illustration purposes, 
we show this 3D state space for the two image 
categories (fruits and faces) for which memory 
performance was the best. In the HA (Fig. 3E, 
left), the relative positions of a “new face” with 
respect to an “old face” were preserved across 
tasks (shown as differently colored planes). The 
relatively parallel location of the subspace of 
neural activity occupied by the two tasks per- 
mits cross-task generalization for both image 
category and familiarity. In contrast, in the 
MFC (Fig. 3E, right), the relative positions of 
the four conditions were not preserved. This is 
consistent with the weaker cross-task gener- 
alization performance observed in the MFC 
relative to the HA (Fig. 3, G and H), resulting 
in reduced generalization indices in the MFC 
compared with the HA (Fig. 3J; this metric 
takes into account different levels of within- 
task decoding accuracy, which is an upper bound 
for the achievable generalization performance; 
see the supplementary text section of the sup- 
plementary materials for details). 


Representation of choice 


We next investigated how the subject’s choice 
(yes or no) is represented by single neurons 
(Fig. 4A shows examples) and the population. 
Decoding accuracy for choices was highest 
in the MFC, with an average population de- 
coding performance of 89% compared with 
68% in the HA (Fig. 4B; Aime = 19% versus 
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empirical null, P < 1 x 107?; 61% in AMY and 
57% in HF when trained separately; fig. S7G 
shows this result as a function of number of 
neurons used). Further single-neuron (fig. S11, 
Aand B) and population (Fig. 4E and fig. S11C) 
analysis confirmed that the choice signal was 
significantly stronger in the MFC regardless of 
selection threshold and quantification method. 
We therefore first analyzed choice information 
in the MFC (see below for results in the HA). 
Choice decoding in the MFC was strongest 
shortly after stimulus onset, well before the 
response was made (fig. S7A). To disassociate 
representation of choice (yes or no) from the 
representation of ground truth (old or new) 
during the memory recognition task, we fit a 
choice decoder to a subset of trials, half of which 
were correct and half of which were incorrect. 
The activity of MFC cells predicted choice but 
not the ground truth at levels significantly dif- 
ferent from chance (Fig. 4C; choice decoding 
is above the 95th percentile of the null distribu- 
tion, whereas new/old decoding is not; see fig. 
S7D for this analysis shown over time). Choice 
could be decoded from the MFC separately for 
both the correct and the incorrect trials (fig. S51). 
As acontrol for potential confounds due to RT 
differences between tasks (see fig. SIA), we 
acquired data from a separate control task in 
which we eliminated RT differences behav- 
iorally by adding a waiting period (six sessions 
in five subjects; 2 = 180 and 162 neurons in the 
HA and MFC, respectively; see Materials and 
methods and fig. S6). As in the original task, 
MFC cells represented the subject’s choice (fig. 
S6, C to G), thereby confirming that this sep- 
aration is not due to RT differences. 

We used multidimensional scaling to visual- 
ize the population activity for the eight com- 
binations of choices, task types, and response 
modality (Fig. 4D, and see Materials and me- 
thods). The resulting geometrical configura- 
tion indicates that choice decoding generalizes 
across response modality (Fig. 4D, top) but not 
across task types (Fig. 4D, bottom). We there- 
fore computed the cross-task generalization 
performance of a decoder trained on choices 
during one task and tested on the other. We 
performed this analysis across time (Fig. 4E; 
see fig. S7B for this analysis shown separately 
for pre-SMA and dACC) and also in a single 
poststimulus time bin (Fig. 4F). To avoid con- 
founds due to response time differences, we 
performed the fixed window analysis (Fig. 4F) 
only for the control task, where the timing 
between tasks was identical (fig. S6B). While 
the choice signal did not generalize across 
task types (Fig. 4F), it did generalize across 
response modality within the same task type 
(Fig. 4G). Quantifying this observation with 
the generalization index confirmed this im- 
pression (Fig. 4H). We also examined choice 
signals in the HA. Although choice signals 
were comparatively weak in the HA (fig. S7E 
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and Fig. 4B; see Fig. 4B for statistics), they 
nevertheless exhibited a pattern of general- 
ization similar to that seen in the MFC (fig. 
S7, E and F). 

To test the possibility that any task might 
yield a distinctive choice axis that does not 
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generalize to any other task, we considered the 
four subtasks that make up the categorization 
trials in a given session (the target category 
can be any one of the four possible image cate- 
gories). We tested whether the choice signal 
generalizes across these subtasks by training 


and testing across blocks requiring different 
categorizations. Choice decoding generalized 
across all subtasks in the categorization task 
but not the memory task (Fig. 41). Next, we 
compared the dynamics of the population ac- 
tivity between the eight conditions arising 
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Fig. 4. Task-specific representation of choice. (A) Example MFC choice 
cells, split by choice (yes or no) and task. (B) Population choice decoding 
accuracy was significantly greater in MFC than in HA (across all trials, 

Atrue = 19% versus empirical null, P < 1 x 10°). (C) MFC cells represent 
choice and not the ground truth (i.e., new or old; memory trials only). 

(D) Population summary (neural state space) of choice-related activity in 
MFC, plotted in 3D space derived using MDS. (Top) Variability due to 
response modality. The highlighted planes connect the points of state space 
occupied by activity when using button presses (purple) or saccades (green). 
(Bottom) Variability due to task type. The highlighted planes connect the 
points of state space occupied by activity in the same task. (E) Choice- 
decoders trained in one task do not generalize to the other task (bin size: 
500 ms, step size: 16 ms). (F) Same as (E), but for a fixed 1-s time 
window starting at 0.2 s after stimulus onset. (G) Choice decoding 
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decoding (see Materials and methods) to summarize (F) and (G). The 
representation of choices generalized across response modality but not task. 
(1) Generalization between different subtasks of the categorization task 

but not between task types. The colored bars indicate the 5th to 95th 
percentile of the null distribution. (J) (Left) State-space trajectories for the 
four conditions arising from the combination of response (yes or no) and 
task (categorization or memory). (Right) Trajectory similarity, computed in 
an 8D latent space (recovered using GPFA, see Materials and methods) 
across the eight conditions arising from the combinations of choice, effector 
type, and task. (K) Decoder weight of each cell for decoding choice during 
the categorization and memory task. The cells in the top 25th percentile 
are shown in black. The inset shows the angle created by the vector 
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from the combination of choice, effector type, 
and task in a state-space model recovered 
using Gaussian process factor analysis (GPFA) 
[(45), and see Materials and methods]. Com- 
paring the pairwise similarity between the 
trajectories in state space (Fig. 4J, left) within 
the first 500 ms after the stimulus onset re- 
vealed that the patterns of dynamics in state 
space first cluster by task type (Fig. 4J, right; 
see movie S1). 

We next examined whether the population- 
level analysis relied on different sets of neurons 
to decode choice in each of the two tasks. We 
determined how individual cells are recruited 
by a linear decoder (44, 46). For each cell, we 
quantified its importance (46) for both the 
memory and categorization choice decoder 
(Fig. 4K). We then plotted the degree of special- 
ization for each cell on the basis of its impor- 
tance in each task (see Materials and methods). 
Cells that report choice independently of task 
should lie on the diagonal (i.e., an angle of 1/4). 
Instead, the distribution of angles was signif- 
icantly bimodal across all cells (Fig. 4K, inset 
plot; P< 1x 10°, Hartigan dip test), with modes 
centered away from the diagonal. Despite this 
bimodality, we could still use the cells that are 
the most “useful” in one task to train a new de- 
coder that can predict choice well above chance 
(although significantly weaker compared with 
using all cells) in the other task (fig. S7C). This is 
not an example of cross-task generalization, 
because we are fitting a new decoder. 


Task-dependent spike-field coherence between 
MFC cells and HA LFP 


It is thought that the selective routing of 
decision-related information between the HA 
and MEC is coordinated by interareal spike- 
field coherence (24). We therefore asked whether 
MFC neurons phase-lock to ongoing oscillations 
in the LFP in the HA and, if so, whether the 
strength of such interactions is modulated by 
task demands. We performed this analysis for 
the 13 subjects and 33 sessions for which we 
simultaneously recorded from both areas 
(Fig. 5A). In the following, we only used neural 
activity from the 1-s baseline period that pre- 
cedes stimulus onset, to avoid confounds related 
to stimulus onset-evoked activity. Individual 
cells in the MFC showed strong task modula- 
tion of MFC to HA spike-field coherence (Fig. 
5B shows a single-cell example in the dACC). 
At the population level, MFC cells showed sig- 
nificantly stronger theta-band coherence with 
HA oscillations during the memory task than 
during the categorization task (Fig. 5C, 8822 cell 
electrode pairs; P < 13 x 107’, paired ¢ test, 
measured at 5.5 Hz; see fig. S9, B and C, for 
additional controls). This was the case for both 
MFC-hippocampus and MFC-amygdala inter- 
actions (Fig. 5D, n = 3939 and 4884 and P = 
8.8 x 10°* and 4.3 x 10°°, respectively, paired 
t test). This effect was due to changes in phase 
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preference, as there was no significant differ- 
ence in HA LFP power between the tasks (Fig. 
5E, P = 0.08, paired ¢ test of signal power at 
5.5 Hz, estimate across all 8822 cell-electrode 
pairs). Of the 767 MFC cells, a significant sub- 
set of ~100 cells were phase-locked to the 
theta-band HA LFP (fig. S9A), with the largest 
proportion preferring 3 to 8 Hz. 

To determine whether there is a relation- 
ship between the tuning of cells in MFC and 
their interarea coherence with HA, we selected 
for choice cells independently in the categori- 
zation and memory task (see Materials and 
methods for selection model; note selection 
controls for RT differences). This approach 
revealed that 101/767 and 82/767 cells were 
significantly modulated by choice during the 
memory and categorization task, respectively 
(P < 0.001 versus chance for both; see Fig. 4A, 
cells 2 and 1, respectively, as an example). 
Single-neuron decoding showed that it was 
not possible to decode the subject’s choice 
in the categorization task from choice cells 
selected in the memory task and vice versa 
(fig. S5, A to D). Single-neuron analysis re- 
vealed that cells preferring either “no” or “yes” 
choices were present in approximately equal 
proportions in both tasks (fig. S5B). The re- 
moval of the selected choice cells from a pop- 
ulation decoding analysis with access to all 
recorded neurons significantly diminished de- 
coding performance (fig. S5, F and G). Notably, 
each of the selected cells had a high impor- 
tance index, as determined from population 
decoding (Fig. 5F). Considering only the MFC 
choice cells revealed that this subset of cells 
similarly increased their phase-locking dur- 
ing the memory task (Fig. 5G, top), with the 
strongest effect, again, in the theta range [peak 
frequency (fpeak) = 5 Hz, P= 1x 10°, paired t 
test]. Both categorization and memory choice 
cells showed this pattern of modulation (Fig. 
5G, bottom). The memory choice cells exhib- 
ited an increase in gamma-band coherence 
(Fig. 5G, foeax = 38.5 Hz, P = 2 x 10°, paired 
t test). The extent of phase-locking of choice 
cells following stimulus onset (0.2 to 1.2 s) 
during the memory task was significantly 
stronger for correctly retrieved trials than 
for forgotten old trials, indicating behavioral 
relevance for memory retrieval (Fig. 5H). Lastly, 
to exclude the possibility that this interarea 
effect was due to task-dependent changes within 
the HA, we examined the phase-locking proper- 
ties of HA cells to their own locally recorded 
LFP (LFP and spiking activity is recorded on 
separate electrodes; see Materials and meth- 
ods). The spiking activity of 331/663 HA cells 
was significantly related to the theta-frequency 
band LFP (Fig. 5I, shown for f = 5.5 Hz). The 
strength of this local spike-field coherence was, 
however, not significantly different between 
the two tasks (Fig. 5J, P = 0.61, paired t test, 2 = 
2321 cell-electrode pairs). 
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Discussion 

We investigated the nature of flexible decision- 
making in the human brain by probing how 
the strength and/or geometry (44) of neu- 
ral representations of stimulus memory, 
stimulus category, and choice is modified when 
subjects switch between a memory and a 
categorization task. We found evidence for 
both kinds of neural representation changes 
resulting from changing task demands for a 
subset of the studied variables. In the MFC, 
both the strength and geometry of represen- 
tation of stimulus memory changed as a func- 
tion of task demands. In contrast, in the HA, 
both the strength and geometry of the repre- 
sentation of stimulus memory were insensi- 
tive to task demands (Fig. 3, D and H). Our 
finding of memory signals in the amygdala 
supports the hypothesis (47, 48) that the 
amygdala contributes to recognition memory 
by signaling stimulus familiarity. Representa- 
tion strength of stimulus category in both the 
HA and MFC was stronger in the memory 
task, but the geometry of this representation 
was also modulated by the task in the MFC 
(Fig. 3G, right). Overall, these results show 
that the geometry of the representations (as 
assessed by cross-task generalization) of stim- 
ulus familiarity and memory were significantly 
less sensitive to task demands in the HA com- 
pared with the MFC (Fig. 3, G and H). 

At the population level in the MFC, choices 
in both the memory and categorization task 
were decodable with high reliability, but these 
decoders did not generalize across the two 
tasks. Choice decoding generalized across sub- 
tasks of the categorization task and across 
changes of response modality in both tasks, 
indicating that changes in representations were 
due to switching between a task that requires 
memory retrieval and one that does not. While 
the choice signal was significantly weaker in 
the HA, this same pattern of generalization 
also held for the HA, suggesting the pos- 
sibility that the task demand-dependent choice 
representation we found in the MFC is widely 
distributed in the brain. A group of task demand- 
dependent cells in the MFC were choice cells, 
which preferentially signal behavioral decisions 
for either memory or categorization decisions 
irrespective of response modality and regard- 
less of the ground truth of the decision. Thus, 
from the point of view of downstream areas, 
neurons formed two separate decision axes: 
one for memory-based decisions and one for 
categorization-based decisions. These two de- 
cision axes were instantiated selectively so that 
they were only present when required by the 
current task. 

These findings contrast with prior work on 
task switching involving different tasks that 
required purely perceptual decisions, which 
found a single decision axis in the monkey 
prefrontal cortex, with task-irrelevant attributes 
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also represented (3). We found that memory- 
based choices add a second decision axis, which 
is present only when decisions engage memory 
retrieval processes. Although task-sensitive re- 
presentations of choice have been shown in 


recordings from rodents and nonhuman primates 
during perceptual decision-making (2, 3, 49, 50) 
and in human neuroimaging (5/7), our data 
reveal choice representations that specifically 
signal recognition memory-based choices at the 


single-cell level. It has long been appreciated 
that the frontal lobes are critical for initiating 
and controlling memory retrieval (30, 52-54). 
Neuroimaging reveals that patterns of activity in 
some frontal and parietal areas are modulated 
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Fig. 5. Modulation of interareal spike-field coherence by task demands. 
(A) Analysis approach. Inset shows that only data from the baseline was used 
[except in panel (H)]. (B) Spike-field coherence for a cell in dACC relative 
to all channels in the ipsilateral hippocampus. (C) Phase-locking of MFC 
cells to HA LFP. (Top) Average interarea PPC of all cell-electrode pairs for 
each task. (Bottom) Significance of difference between tasks; peak difference 
was at f = 5.5 Hz. Dashed line shows the threshold (P = 0.05/56, Bonferroni 
corrected). (D) Difference in average interarea PPC at f = 5.5 Hz between 
task conditions for all possible cell-electrode pairs (from left to right, 

n = 8822 electrode pairs, P = 1.3 x 10°77; n = 3938, P= 8.8 x 10* n= 4884, 
P = 4.3 x 10°: paired t test). (E) Average spike-triggered power was 

not significantly different between the two tasks (paired t test, n = 8822 cell 
electrode pairs, P = 0.08). (F) Single-neuron analysis of choice cells. 
Importance index assigned by the decoder to each cell for decoding choices 
in either task. Selected choice cells are indicated in color. (@) MFC-HA 
spike-field coherence for choice cells. (Top) Average PPC for all choice 
cells in MFC (209 cells, 2384 cell-electrode pairs) for the two tasks. 
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(Bottom) Significance of difference between tasks, shown separately for 
memory and categorization choice cells (n = 1176 and 906 cell-electrode 
pairs, respectively). Only memory choice cells show a significant difference in 
the gamma band (P = 2 x 10°, t test). (H) Difference in spike-field coherence 
between true-positive (correct retrieval) and false negative (incorrect 
retrieval) trials measured in the [0.2 s, 1.2 s] window after the stimulus onset, 
shown separately for memory choice cells (left panel) and categorization 
choice cells (right panel) in the theta-frequency (4 to 10 Hz) and gamma- 
frequency (30 to 80 Hz) bands. PPC was significantly stronger in correctly 
retrieved trials in the theta band for memory choice cells (Amem = 0.003, 
P= 0.002; Acat = -0.001, P = 0.3; paired t test) and in the gamma band for both 
types of choice cells (Amem = 0.002, P = 0.005; Acat = 0.004, P = 7.2 x 10°. 
paired t test). (1) Spike times of HA cells are coherent with local theta-band 
(3 to 8 Hz) LFP. (J) Average local PPC in the HA did not differ significantly as a 
function of the task (f = 5.5 Hz; P = 0.51, paired t test, n = 2321 cell-electrode 
pairs). Error bars in panels (D) and (H) denote the 95% confidence interval 
(bootstrap, n = 10,000 iterations). All other error bars are SEM. 
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by memory retrieval demands (34, 35, 37, 38, 55), 
whereas memoty-related activity patterns in 
the MTL are comparatively insensitive to re- 
trieval demands (34). These findings have led 
to the proposal that the memory retrieval net- 
work consists of specialized processes separate 
from those used for other kinds of decisions 
(6, 56, 57). The memory-choice axis we de- 
scribe is a potential cellular substrate for this 
critical aspect of human cognitive flexibility. 
Future work is needed to investigate whether 
similar principles also apply to hippocampus- 
dependent associate or source memory-based 
(58-60) decisions, which we did not assess here 
(we probed recognition memory). 

A second group of cells that we characterized 
in the MFC signal the currently relevant goal 
(task type and response modality) throughout 
the task. These cells switched their activity pattern 
when instructions indicated a change in task 
demands. Although these switches were rapid, 
they were not instantaneous, likely reflecting 
the cost of switching between memory retrieval 
and categorization modes (67-63). We hypothe- 
size that these cells facilitate the holding of 
the active task set in working memory (64, 65) 
and configure brain networks in preparation 
for appropriate execution of the instructed 
task (12, 34, 66, 67). Task-switching costs are 
a much investigated aspect of cognitive flexi- 
bility (39, 61-63), but how they arise and why 
some task switches are more difficult than 
others remain poorly understood. The MFC 
cells we describe here offer an avenue to di- 
rectly investigate these questions. 

Finally, we uncovered a possible mechanism 
by which memory-based information can be 
routed dynamically between the MFC and HA 
when a task requires memory retrieval. Chang- 
ing long-range synchronization of neural activ- 
ity is thought to be a way by which functional 
connectivity between brain areas can be changed 
flexibly (68-71). Here, we reveal a specific in- 
stance of this phenomenon at the cellular level in 
humans in the form of changes in the strength 
of cortico-hippocampal and cortico-amygdala 
functional connectivity. Hippocampus-medial 
prefrontal cortex (mPFC) functional connectivity 
in rodents supports spatial working memory 
(24) and is prominent during both navigation 
and rest (72-74). Similarly, amygdala-mPFC 
functional connectivity supports flexibly switch- 
ing between aversive and neutral behaviors 
depending on learned cues (75). But it is not 
known whether these pathways serve a role in 
long-term memory retrieval in humans and, 
if so, whether this retrieval can be engaged 
selectively. Similarly, in humans, MTL-PFC con- 
nectivity changes, as measured by fMRI, have 
been related to control demands over memory 
retrieval (36, 76), but it remains unclear how 
these indirect metrics relate to long-range syn- 
chronization as measured in rodents. Here, we 
show that MFC-HA connectivity is selectively 
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enhanced during the memory task, indicating 
that patterns of interareal connectivity change 
in preparation of initiating memory retrieval 
(77, 78). The extent of synchrony after stimulus 
onset is stronger when a memory is success- 
fully retrieved compared with when it is forgot- 
ten. Memory choice cells in the MFC exhibited 
enhanced gamma-frequency band coordina- 
tion of their spiking activity with the HA LFP, 
and this modulation was behaviorally relevant 
after stimulus onset, which reveals a specific 
cellular-level instance of a role for gamma 
oscillation-mediated coordination of activity 
between distant brain regions (24, 79) in human 
memory retrieval. 


Materials and methods 
Subjects 


Subjects were 13 adult patients being eval- 
uated for surgical treatment of drug-resistant 
epilepsy that provided informed consent and 
volunteered for this study (see table S1). The 
institutional review boards of Cedars-Sinai 
Medical Center and the California Institute 
of Technology approved all protocols. We ex- 
cluded potential subjects who did not have 
at least one depth electrode in medial frontal 
cortex. 


Electrophysiology 


We recorded bilaterally from the amygdala, 
hippocampus, dACC, and pre-SMA using micro- 
wires embedded in hybrid depth electrodes 
(81). From each microwire, we recorded the 
broadband 0.1 to 9000 Hz continuous extra- 
cellular signals with a sampling rate of 32 kHz 
(ATLAS system, Neuralynx Inc.). Subjects from 
which at least one well-identified single neuron 
could not be recorded were excluded. 


Spike sorting and single-neuron analysis 


The raw signal was filtered with a zero-phase 
lag filter in the 300 to 3000 Hz band, and spikes 
were detected and sorted using a semiautomated 
template-matching algorithm (82, 83). All PSTH 
diagrams were computed using a 500-ms win- 
dow with a step size of 7.8 ms. No smoothing 
was applied. 


Electrode localization (relevant for Fig. 1) 


Electrode localization was performed based 
on postoperative MRI scans. These scans were 
registered to preoperative MRI scans using 
Freesurfer’s mri_robust_register (84) to allow 
accurate and subject-specific localization. To 
summarize electrode positions and to provide 
across-study comparability, we also aligned the 
preoperative scan to the MNI152-aligned CIT168 
template brain (85) using a concatenation of 
an affine transformation followed by a sym- 
metric image normalization (SyN) diffeomor- 
phic transform (86). This procedure provided 
the MNI coordinates that are reported here for 
every recording location. The electrode loca- 
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tions shown on the Atlas Brain (Fig. 1, C and 
D) are for illustration only. Apparent localiza- 
tion outside the target area or in white matter 
is due to usage of an Atlas Brain alone. 


Eye tracking (relevant for fig. S1) 


Gaze position was monitored using an infrared- 
based eye tracker with a 500-Hz sampling rate 
(EyeLink 1000, SR Research) (87). Calibration 
was performed using the built-in nine-point 
calibration grid and was only used if validation 
resulted in a measurement error of <1 degree 
of visual angle (dva) (average validation error 
was 0.7 dva). We used the default values for the 
thresholds in the Eyelink system that determine 
fixation and saccade (eye movement) onsets. 


Task 


Each session consisted of eight blocks of 40 trials 
shown in randomized order. At the beginning 
of each block, an instruction screen told subjects 
verbally the task to be performed for the fol- 
lowing 40 trials (categorization or recognition 
memory), the response modality to use (but- 
ton presses or eye movements), and which visual 
category is the target (for categorization task 
only; either human faces, monkey faces, fruits, 
or cars; order was pseudorandom so that each 
image type was selected as the target at least 
once) (see Fig. 1). The task to solve was either 
“Have you seen this image before, yes or no?” or 
“Does this image belong to the target category, 
yes or no.” Odd-numbered blocks (1, 3, 5, and 7) 
were categorization blocks; even numbered blocks 
were memory blocks (2, 4, 6, and 8). Button 
presses (yes or no) were recorded using a re- 
sponse box (RB-844, Cedrus Inc.). Eye move- 
ments to the left or right of the image served 
as responses in the eye movement modality 
(left = yes, right = no). The mapping between 
button and screen side and yes/no responses 
was fixed and did not change; “yes” was on the 
left and “no” was on the right. Subjects were 
reminded that left = yes, and right = no, at the 
beginning of each of the eight blocks. In the 
first block, all images were novel (40 unique 
images). In all subsequent blocks, 20 new im- 
ages were shown randomly intermixed with 
20 repeated images (the “old set”). The 20 repeated 
images remained the same throughout a ses- 
sion. We used entirely nonoverlapping image 
sets for patients that completed multiple ses- 
sions. The response modality (button presses 
or eye movements) was initially selected ran- 
domly and then was switched in the middle 
of each block (an instruction screen in the 
middle of each block showed the response 
modality to be used for the remainder of the 
block). In sessions where eye tracking was not 
possible because of problems with calibration 
(five sessions in three patients; see table S1), all 
trials used the button presses as the response 
modality. No trial-by-trial feedback was given. 
In between image presentations, subjects were 
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instructed to look at the fixation cross in the 
center of the screen. 


Control task (relevant for fig. S6) 


In 5 of the 13 subjects in this dataset (6/33 
sessions), we ran an additional control task 
in order to help determine whether neural 
responses reflected processing of stimuli, of 
decision variables, or of motor response plans. 
Unlike the standard task where the subjects 
could respond at any time after the stimulus 
onset (thus making it difficult to distinguish 
decision from choice), in this control task the 
subjects were instructed to wait until the re- 
sponse cue in order to register their answer, 
either with a button press or with a sac- 
cade. The stimulus was presented for a fixed 
amount of time (1-s duration) and after a 
0.5-to-1.5-s delay period, the subjects were 
asked to respond to the question relevant for 
that block. 


Mixed-effects modeling of behavior (relevant for 
Fig. 1 and fig. S1) 


For the group analysis of behavior, we used 
mixed-effects models of the form y = XB + Zb + 
e, where y is the response, X is the fixed-effects 
design matrix, B is the fixed-effects coeffi- 
cients, Z is the random-effects design matrix, 
bis the random-effects coefficient, and € is the 
error vector. In all analyses, we used a random 
intercept model with a fixed slope. The group- 
ing variable for the random effects was the 
session ID. The reported P values in the main 
text correspond to the fixed-intercept for the 
relevant variable. In the case of measuring the 
effect of number of expositions (i.e., number 
of times an image was seen) on the subject’s 
accuracy during the memory trials, we used 
a mixed-effects logistic regression with the in- 
dependent variable as an ordinal-valued whole 
number ranging from 1 to 7. The response was 
a logical value indicating success or failure on 
each memory question. Prior to running any 
analysis of reaction time data, we excluded out- 
liers from the distribution using the following 
procedure: A sample was considered an outlier 
if it was outside the 99th percentile of the em- 
pirical distribution. 


Reaction time matching procedure 


As a control, we matched for RTs between the 
two tasks (categorization and memory) to ex- 
clude for potential differences due to difficulty. 
To achieve this, we first added noise to all re- 
action times (SD = 1 ms), followed by locating 
pairs of trials with RTs that were equal to 
within a tolerance of 0.1 s. Matching pairs 
were then removed, and this procedure was 
repeated iteratively until no further matches 
could be found. Unmatched trials were ex- 
cluded (resulting in reduced statistical power 
owing to fewer available trials). We only used 
the resulting match if the RTs between the 
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two groups were not significantly different. If 
not, the procedure above was repeated. 


Selection of visually (VS) and memory-selective 
(MS) cells (relevant for fig. S3) 


A cell was considered a VS cell if its response 
covaried significantly with visual category, 
as assessed using a 1x4 analysis of variance 
(ANOVA) test at P < 0.05. For each selected 
cell, the preferred image category was set to 
be the image category for which the mean 
firing rate of the cell was the greatest. All trials 
were used for this analysis. MS cells were 
selected using the following linear model 


frcen~1 + By + category —F Bo. new/old + By ert 


where “category” is a categorical (1x4) va- 
riable, “new/old” is a binary variable, and 
“rt” is a continuously valued variable. A cell 
was determined to be memory-selective if 
the t-statistic for B2 was significant with P < 
0.05. We excluded the first block of trials 
(40 images) from the analysis, in order to 
keep the number of new and old stimuli the 
same. Spikes were counted for every trial in 
a 1-s window starting at 200 ms after stimulus 
onset. 


Selection of choice cells (relevant for fig. S5) 


Choice cells were selected using a regression 
model applied to the firing rate in a 1-s-size 
window starting 200 ms after stimulus onset. 
We fit the following regression model 


free~1 + By. category + By * response + By ert 


where the response is binary (yes or no), category 
is a categorical variable with four levels, and RT 
is the reaction time. We fit this model sepa- 
rately to trials in the memory and categori- 
zation condition, assuring independent selection 
of cells. RT was included as a nuisance regressor 
to control for reaction time differences between 
the two possible responses (see fig. SIA). A cell 
qualified as a choice cell if the t-statistic of the 
B. term was significant at P < 0.05 for at least 
one of the two task conditions. The response 
preference of significant cells for either yes 
or no was determined based on the sign of 
Bo (positive = yes, negative = no). Notice that 
the selection process uses separate trials for 
memory choice cells and categorization choice 
cells. All trials, regardless of whether the an- 
swer was correct or incorrect, were used for 
selection. To estimate the significance of the 
number of selected cells, we generated a null 
distribution by repeating the above selection 
process 1000 times after randomly reshuffling 
the response label. We estimated this null dis- 
tribution separately for choice cells in the memory 
and categorization condition and used each to 
estimate the significance of the number of se- 
lected cells of each type. 
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Chance levels for cell selection (relevant for figs. 
S2, S3, and S5) 

To estimate the chance levels for cell selection, 
we repeated above procedures for selection of 
visual category, memory selective, and choice 
cells after randomly scrambling the order of 
the labels determining the category member- 
ship being selected for (yes/no response, visual 
category, and new/old ground truth, respec- 
tively). We repeated this procedure 1000 times. 


Single-cell decoding (relevant for fig. S5) 


Single-cell decoding was done using a Poisson 
naive Bayes decoder. The features used were 
spike counts in a 1-s window, in the interval 
[0.2 s, 1.2 s] relative to stimulus onset. The 
decoder returns the probability of a class label, 
given the observed spike count. The class label 
was binary (“yes” or “no”). The model assumes 
that the spike count is generated by a univar- 
iate Poisson distribution, and a separate mean 
rate parameter (A) is fit to each feature-class 
pair. For a new observation, class membership 
is determined on the basis of the likelihood 
value. Notice that we used a single spike count 
as a feature, so the naive assumption of the 
decoder is no longer relevant in this case. 


Population decoding (relevant for Figs. 2, 3, and 
4 and figs. S2 to S5, S7, and S8) 


Single-trial population decoding was performed 
on a pseudo-population assembled across ses- 
sions (88). We present decoding results for a 
variety of task variables: (i) image category, 
Gi) new versus old, (iii) choice during memory 
trials, (iv) choice during categorization, (v) 
task, and (vi) response type. In order to esti- 
mate the variance of the decoding performance, 
on each iteration of the decoder (minimum of 
250 iterations), we randomly selected 75% of 
the cell population that was being analyzed. 
For example, to measure choice decoding in 
MFC (as shown in Fig. 4), we would randomly 
select 575/767 cells on each iteration of the 
decoder. The total number of available cells 
depended on the variable that was being de- 
coded. For example, for response type decoding, 
the number of cells in MFC was 593, because 
28/33 sessions included both response types. 
We matched the number of trials per condi- 
tion contributed by each cell that was selected 
to participate in the population decoding. For 
most task variables (image category, new/old, 
context, effector-type), the number of samples 
from each cell was equal, because the task 
structure remained the same across all sub- 
jects and sessions. For choice decoding, how- 
ever, the number of instances varied, because 
the subjects were free to respond with a “yes” or 
a “no” for each stimulus. We therefore matched 
the numbers for the smallest group across all 
subjects. This matching procedure can further 
reduce the number of cells we included in 
the decoding that do not have the minimum 
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number of trials needed per condition. For the 
population decoding and cross-condition gen- 
eralization of familiarity presented in Fig. 3, 
we used all image categories for which the 
subjects showed above-chance recognition per- 
formance (see fig. SIB). Therefore we used im- 
ages of cars, fruits, and faces, excluding images 
of monkeys from the analysis. For all other anal- 
ysis, we used all available categories. 

A series of preprocessing steps were carried 
out before training the decoder. Firing rates for 
each cell were first de-trended (to account for 
any drift in the baseline-firing rate) and then 
normalized (z-scored) using the mean and 
standard deviation estimated from the training 
set. We then performed 10-fold cross validation 
using a linear support vector machine (SVM) 
decoder to estimate performance, as imple- 
mented by the “fitcecoc” function in MATLAB. 
We used an SVM with a linear kernel and a 
scale of 1. Decoding results are reported either 
as a function of time or in a fixed time window. 
Time-resolved decoding was done on spike 
counts measured in a 500-ms moving window, 
with a 16-ms step size. For fixed-window decoding, 
we used spike counts in a 1-s window. The loca- 
tion of the window depended on the analysis. 
In Fig. 2, for example, we used a [-1, 0] window 
relative to stimulus onset for task type and re- 
sponse type decoding. In Fig. 3, we used spike 
counts in a [0.2, 1.2] window relative to stimulus 
onset for decoding image category and new/old. 


Null models for testing significance of 
decoding performance 


Throughout the manuscript, we compare the 
performance of our decoders against the 95th 
percentile of a null distribution. The way that 
this null distribution is generated depends 
on the variable being decoded. For variables 
such as image category, new versus old, and 
response (i.e., yes versus no), we used a simple 
shuffling procedure for the labels. For varia- 
bles such as task type, which had structure 
over time (memory blocks were always pre- 
ceded by categorization block), small drifts 
in firing rate might lead to inflated decoding 
accuracy. Therefore, for such variables, the shuf- 
fling was done in such a way as to preserve 
their temporal relationship. Specifically, we 
offset (i.e., circular shift) the labels by a random 
integer value (sampled from the range +10 to 
20 trials). In the case of task decoding from 
the baseline firing rate, this is a very conser- 
vative measure of the null decoding perform- 
ance, because many trials retain their original 
label, thereby inflating the accuracy. This also 
means that the mean performance of the null 
distribution will not be the theoretical chancel 
level. In the case of task decoding, the theo- 
retical chance level is 50% (binary classifi- 
cation). Using the circular shift method for 
scrambling labels, the mean of the null dis- 
tribution was ~60%. 
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To compare the performance between dif- 
ferent decoders, for example choice decoding 
from the HA versus MFC population, we con- 
structed an empirical null distribution from 
the pairwise differences in the performance of 
these two decoders trained using the shuffled 
labels. For example, if we get N estimates of 
the null performance (i.e., after shuffling the 
labels) of the HA decoder and N estimates of 
the null performance of the MFC decoder, 
we construct a distribution of the NeN = N? 
pairwise differences. We can then compute 
the significance of the true difference in de- 
coding performance between MFC and HA, 
Atrue, relative to this distribution. The vari- 
ance of the null distribution is sensitive to 
the number of trials available for decoding 
because it changes the resolution (step size) 
by which decoding accuracy can change. For 
example, for 10 trials, the accuracy can take 
values from 0 to 1 in increments of 0.1. This 
results in different values for the 95th per- 
centile of the null distribution and is the reason 
why in some cases a given difference in decod- 
ing accuracy is significant while it is not in 
others. Unless otherwise specified, all P values 
for comparing decoding performance between 
conditions or brain areas are calculated using 
this approach. In the one case where the num- 
ber of trials in a condition was too low to re- 
liably estimate the null distribution (fig. S51) 
and for comparing the generalization index 
(fig. 3J) we used a bootstrap test for equality 
of means (89) to compare the two conditions to 
assign a P value to the true difference (repeated 
1000 times to estimate the null distribution). 


Multidimensional scaling (relevant for figs. 3 and 4) 


Multidimensional scaling (MDS) was used 
only for visualization. We computed MDS 
using Euclidean distances (MATLAB function 
“mdscale’) on z-scored spike count data in the 
[0.2 s, 1.2 s] window relative to image onset. In 
Fig. 3E, for example, MDS was computed on 
the activity across the entire population of HA 
and MFC cells, averaged across the eight con- 
ditions plotted (new/old ® task ® image cate- 
gory, where ® denotes the Cartesian product). 
Here the image category was restricted to images 
of human faces and fruits, for visualization 
purposes. For the cross-condition generalization 
performance, we use all four image categories. 
In Fig. 4D we compute MDS on the population 
of MFC cells, averaged across eight conditions 
(response ® task ® effector, where ® denotes 
the Cartesian product). In all cases, we use MDS 
to map the neural activity to a 3D space. 


Normalized weight metric (relevant for Figs. 4 
and 5 and figs. S5, S7, S8, and S10) 


The normalized weight metric is computed 
from the weight that a decoder assigns to a 
particular cell for a given classification. This 
weight is denoted as w}, where the index i 
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denotes the cell, and the index ¢ denotes the 
condition (for example, categorization or mem- 
ory). The weight is converted into a normal- 
ized measure called an importance index, 
defined as 
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State-space analysis (relevant for Fig. 41) 


We used Gaussian-process factor analysis (GPFA) 
(45) to analyze the dynamics of the average 
population activity for the eight conditions 
arising from the combination of choice (yes, 
no), response modality (button press, saccade), 
and task (memory, categorization). The recov- 
ered latent space was eight-dimensional (8D), 
and all similarity measurements between tra- 
jectories were performed in this space (not in 
the 3D projections shown in the figure). The 
activity was binned using 20-ms windows. All 
analysis was computed and visualized using 
the DataHigh (90) MATLAB toolbox. Similarity 
measurements between two conditions were 
computed and averaged over the first 500ms 
after stimulus onset as follows 
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where 7,(f) and 7'9(Z) are the 8D state-space tra- 
jectories for conditions 1 and 2 [7(¢) and 7° (t) 
indicate the derivatives over time], respectively. 


ANOVA model (relevant for figs. S4 and S11) 


We used a single-cell ANOVA model to tease 
apart the contributions of choice, visual cate- 
gory, memory, and response time on the 
firing rate of a cell. The model was of the fol- 
lowing form 


frcen~By * category Bo . familiarity ar Bs « choice ar Ba ert 


where fr, is the mean firing rate in a fixed 
window (0.2 to 1.2 s following stimulus onset) 
or a moving window of 500 ms to analyze the 
time course. The ANOVA model is fit in- 
dependently at each point of time. We then 
compute the F-statistic for each of the regres- 
sors and report the average F-statistic across 
the entire population of recorded cells, fit twice 
to each cell for the memory and categorization 
task (figs. S4, D and E, fig. S11). To compare the 
effects of task on the representation of indi- 
vidual variables, we compare the distribution 
of F-statistics estimated separately on each task, 
for each cell in the population. We use this 
approach as a measure of modulation in the 
strength of representation for a variable in- 
duced by task switching. This comparison does 
not make predictions about generalizability 
from one task to the next, because the model 
is fit independently. 
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Generalization index (relevant for Figs. 3 and 4) 
To compare the within-condition decoding to 
the across condition generalization, we used 
a generalization index defined as following 


cross — chance 
| ara a Pee 
within — chance 


where “within” is the decoding performance 
within condition, “cross” is the decoding across 
condition, and “chance” is the chance decoding 
performance for the variable of interest (choice = 
0.5, new/old = 0.5, familiarity = 0.5, image cate- 
gory = 0.25). 


Spike-field coherence analysis (relevant for 
Fig. 5 and fig. S9) 
LFP preprocessing 


The local-field potential recordings were high- 
pass filtered at 1 Hz. The raw recordings, sam- 
pled at 32 kHz, were then downsampled to 
500 Hz. The downsampling procedure was 
done with the “resample” command in MATLAB, 
which applies the appropriate antialiasing filter 
prior to reducing the sampling rate. For each 
session, we screened all MFC and HA electro- 
des in order to make sure that there were no 
artifacts that could contaminate the spike- 
field metrics. We excluded all electrodes with 
interictal discharges (IEDs) visible in the raw 
trace (by visual inspection). Specifically, in screen- 
ing for IEDs, we looked for large stereotyped, 
recurring transients in the raw recording that 
did not correspond to cellular spiking activity. 
The presence of such transients would disqualify 
an electrode from further consideration. 


Spike-field coherence (SFC) 


All spike-field coherence analysis was performed 
on snippets of the LFP extracted around the 
spike. We extract snippets for every cell-electrode 
pair. For example, to measure interarea SFC 
between a single cell in pre-SMA and HA LFPs, 
we extracted n snippets each (nm = number of 
spikes) from each of the eight ipsilateral 
electrodes in hippocampus and eight elec- 
trodes in the ipsilateral amygdala. For ses- 
sions where we used a local reference (i.e., 
bipolar referencing), we exclude the reference 
wire. For intra-area coherence (for example, 
HA spikes to HA field) we also exclude the 
wire on which the cell was recorded to avoid 
contamination by spike waveform. For each 
snippet and for each cell-electrode pair, we 
compute the spike-triggered spectrum using 
the FieldTrip “mtmconvol” method, which com- 
putes the Fourier spectrum of the LFP around 
the spikes using convolution of the complete 
LFP traces. The spectrum was computed with 
a single “hanning” taper, at 56 logarithmically 
spaced frequencies ranging from 2 Hz on the 
low end to 125 Hz on the high end. The length 
of the snippet window was dynamic as a func- 
tion of the frequency examined; the snipped 
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length was set to equal to two cycles of the 
underlying frequency at which the spectrum 
was estimated (i.e., 2 Hz—2 s snippet). We 
estimated the phase for each snippet and for 
each of the 56 frequencies from the complex- 
valued Fourier coefficients (i.e., phasor). We 
used the pairwise phase consistency (PPC) 
metric as the measure of coherence. For the 
spike-triggered power, we compute the mag- 
nitude of the spectral coefficients returned by 
the Fourier transform (also computed for each 
cell-electrode pair) for each snippet and aver- 
aged the spectra. Unless otherwise stated, all 
SFC results in the paper are based on spikes 
recorded during the baseline period between 
trials (1-s window preceding stimulus onset). 


Group comparisons using the SFC metric 


When comparing two or more groups using 
PPC (such as memory versus categorization), 
we balanced the number of spikes between 
the two groups. To reduce bias involved in 
subsampling the larger group, we resampled 
the spikes from the two groups 200 times 
and computed the PPC metric on each itera- 
tion. The final coherence measure for a given 
cell-electrode pair was an average across all 
200 iterations. 

To ensure that the underlying local field po- 
tential does not vary in a consistent way across 
conditions, we compare the distribution of aver- 
age voltage values for each of the conditions in 
our spike-field coherence analysis. In the case 
of the task contrast during baseline (i.e., memory 
versus categorization), we show the distribution 
of area under the curve (AUC) values computed 
separately for each electrode in the amygdala 
and hippocampus (fig. S9D shows that there 
was no significant difference). The AUC for 
each electrode is computed using the average 
baseline magnitude across memory and cate- 
gorization trials. In the case of the spike-field 
coherence results during the stimulus onset 
(Fig. 5H), to reduce any potential confounds 
related to event-related potentials, we used 
only sessions with local referencing (bipolar). 
The local reference (set to one of the eight 
microwires in the electrode cluster implanted 
in each brain area) significantly diminishes 
the magnitude of any event-related potentials 
after stimulus onset. To confirm this, we re- 
peated the AUC analysis mentioned above, for 
the contrast in Fig. 5H [i.e., true positive (TP) 
versus false negative (FN)]. The results (shown 
in fig. S9E) show that there is no significant dif- 
ference between the two conditions of interest. 
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CHROMATIN SEQUENCING 


Single-molecule regulatory architectures captured 
by chromatin fiber sequencing 


Andrew B. Stergachis, Brian M. Debo*, Eric Haugen’, L. Stirling Churchman?, 


John A. Stamatoyannopoulos*** 


Gene regulation is chiefly determined at the level of individual linear chromatin molecules, yet our current 
understanding of cis-regulatory architectures derives from fragmented sampling of large numbers of 
disparate molecules. We developed an approach for precisely stenciling the structure of individual 
chromatin fibers onto their composite DNA templates using nonspecific DNA N°-adenine 
methyltransferases. Single-molecule long-read sequencing of chromatin stencils enabled nucleotide- 
resolution readout of the primary architecture of multikilobase chromatin fibers (Fiber-seq). Fiber-seq 
exposed widespread plasticity in the linear organization of individual chromatin fibers and illuminated 
principles guiding regulatory DNA actuation, the coordinated actuation of neighboring regulatory 
elements, single-molecule nucleosome positioning, and single-molecule transcription factor occupancy. 
Our approach and results open new vistas on the primary architecture of gene regulation. 


he primary architecture of chromatin 
comprises nucleosome arrays punctuated 

by short regulatory regions containing 
transcription factors (TFs) and other non- 
histone proteins. This architecture is 
foundational for genome function yet remains 
undefined at the level of individual chromatin 
fibers, the fundamental units of gene regula- 
tion. For example, it is largely unknown how 
regulatory DNA is actuated on individual chro- 
matin fibers, or the degree to which nearby regu- 
latory regions are coordinately actuated on the 
same chromatin fiber. It is also unknown how 
regulatory DNA actuation affects nucleosome 
positioning on individual chromatin fibers, and 
how TF occupancy modulates regulatory DNA 
actuation and function on single templates. Ad- 
dressing these questions requires nucleotide- 
resolution analysis of individual multikilobase 
chromatin fibers, which is not obtainable with 
current single-cell or bulk profiling approaches. 
We sought to develop a method for record- 
ing the primary architecture of chromatin onto 
its underlying DNA template at single-nucleotide 
resolution, thereby enabling the simultaneous 
identification of genetic and epigenetic features 
along multikilobase segments of the genome. 
Current approaches to mapping chromatin and 
regulatory architectures sample large popu- 
lations of chromatin fibers and rely on dissolu- 
tion of chromatin using nucleases such as 
deoxyribonuclease (DNase I) (/, 2), micrococcal 
nuclease (3, 4), restriction enzymes (5), trans- 
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posases (6), or mechanical shearing (7). CpG and 
GpC methyltransferases are capable of marking 
accessible cytosines in a dinucleotide context 
without digesting DNA (8-10), and approaches 
using GpC methyltransferases have been extended 
to single-pass nanopore sequencing (I, 12). How- 
ever, the utility of these approaches for gaining 
insights into the biology of individual chromatin 
fibers is limited because of (i) the sporadic oc- 
currence and linear clustering of CpG and GpC 
dinucleotides in animal genomes as a result 
of mutation and selection (fig. $1); (ii) the con- 
founding influence of endogenous cytosine 
methylation machineries (73); (iii) the marked 
DNA degradation induced by bisulfite conver- 
sion (14); and (iv) the intrinsically limited ability 
of nanopore sequencing to accurately identify 
modified bases on a per-molecule basis (7, 12, 15). 

Unlike cytosine, adenine bases in DNA are 
almost completely devoid of endogenous 
methylation in eukaryotes (16) and occur at an 
average frequency approaching one in every 
two DNA base pairs in animal genomes 
without the clustering and extended deserts 
characteristic of guanine-cytosine dinucleo- 
tides (fig. S1). The sequence-specific adenine 
methyltransferase Dam demonstrates some 
preference for nonchromatinized DNA (77-19), 
suggesting that nonspecific members of this 
class of enzymes could have similar properties. 
We therefore sought to identify a nonspecific 
(that is, non-sequence context dependent) N°- 
adenine DNA methyltransferase (m6A-MTase) 
with high efficiency, high stability, and a molecular 
weight similar to that of nonspecific nucleases 
such as DNase I (~30 kD) that are able to access 
protein-DNA interfaces at nucleotide resolution 
(Fig. 1A). We isolated five distinct nonspecific 
DNA m6A-MTases (20-23) and demonstrated 
that treatment of nonchromatinized (fig. $2) 
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and chromatinized DNA templates (Fig. 1B) with 
increasing amounts of each enzyme resulted in 
monotonic increases in adenine methylation. 


m6A-MTases are quantitative probes of 
DNA accessibility 


To establish selectivity of nonspecific m6A- 
MTases for accessible DNA templates in nuclear 
chromatin, we treated Drosophila melanogaster 
82 cell nuclei with increasing concentrations 
of five adenine methyltransferases and com- 
pared the genomic distribution of m6A-DNA 
with the distribution of DNase I cleavages, 
defined using DNase I-seq [the established 
standard for marking accessible DNA templates 
d, 2)] (Fig. 1C). Immunoprecipitation and se- 
quencing of short [median, 110 base pairs (bp)] 
DNA fragments harboring m6A (AdMTase-seq) 
showed the genomic distribution of m6A-DNA 
to mirror the density of DNase I cleavages 
quantified by DNase I-seq (Fig. 1D). m6A-MTases 
showed high selectivity for accessible DNA 
and quantitative representation of DNase 
I-hypersensitive sites (DHSs) (Fig. 1D and fig. 
$3). Further, m6A-MTases demonstrated de- 
creasing selectivity toward DHSs with increas- 
ing amounts of enzyme (Fig. 1E and fig. $3, A 
and B), analogous with the enzymatic action of 
DNase I (or micrococcal nuclease) on chromatin 
substrates due to increasing marking of the 
far more numerous internucleosomal linker 
regions (24-26). Quantification of DNA acces- 
sibility by AdMTase-seq was highly reproducible 
at both promoter-proximal and promoter-distal 
regulatory elements (Fig. IF and fig. S3C), with 
the enzyme Hia5 demonstrating the highest 
efficiency (figs. S2 and S3). These results show 
that nonspecific m6A-MTases provide quanti- 
tative probes of DNA accessibility in chromatin. 


m6A stencils reveal single-molecule 
chromatin architectures 


We next aimed to reconstruct the primary 
architecture of chromatin fibers at nucleotide 
resolution by sequencing the linear pattern of 
mé6a along multikilobase chromatin stencils; 
we termed this process Fiber-seq (Fig. 2A). To 
implement Fiber-seq, we capitalized on the 
ability of single-molecule DNA sequencers to 
discriminate methylated from unmethylated 
adenine residues based on the DNA polymerase 
kinetics at that base during sequencing (23, 27). 
To achieve nucleotide resolution, we leveraged 
single-molecule circular consensus sequencing 
(CCS) (27, 28) performed on a Pacific Biosciences 
(PacBio) instrument, which enables resequenc- 
ing of each chromatin fiber stencil at least 10 
times, resulting in highly accurate base calling of 
both modified and unmodified nucleotides. 

To create chromatin stencils, we performed 
m6A-MTase treatment of S2 cell nuclei, followed 
by polymerase chain reaction-free library 
construction on high-molecular-weight DNA 
extracted from either treated or untreated nuclei. 
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The resulting libraries were subjected to single- 
molecule CCS, providing very high base-calling 
accuracy (fig. S4, A and B) across chromatin 
fibers of up to ~30 kb in length (average fiber 
length, 10.9 kb) (Fig. 2B). Whereas untreated 
nuclei showed minimal m6A signal, nearly all 
fibers from m6A-MTase-treated cells demon- 
strated a high degree of m6A methylation 
(Fig. 2, C to E, and fig. S4A). The m6A-MTase- 
treated sample yielded an average coverage 
of 43 fibers for each DHS, with the average 
number of m6A-marked bases on each fiber 
overlapping a DHS mirroring the density of 
DNase I cleavages quantified by DNase I-seq 
from bulk nuclei (Fig. 2F). We observed marked 
clustering of adenine methylations into short 
contiguous regions spanning tens to hundreds 
of nucleotides, flanked by extended stretches of 
unmodified nucleotides (Fig. 2, D and E). Two cat- 
egories of methylase-accessible DNA sequences 


A 


Cleavage-based assay: 
Accessible DNA 


Methyltransferase-based assay: 
Accessible DNA 


(MADs) were evident: (i) sequence elements 
with an average length of 272 bp that coincided 
with DHSs and (ii) far more numerous shorter 
sequence elements with an average length of 
67 bp and regularized spacing, paralleling the 
expected size and distribution of internucleo- 
somal linker regions (29) (Fig. 2G and fig. S4D). 
m6A modifications were largely absent from 
nucleosome-wrapped DNA between linker re- 
gions (Fig. 2, D and E), consistent with the 
requirement of m6A-MTases for base flipping 
to modify adenines (30), which is likely sup- 
pressed by nucleosome wrapping. Notably, 
extended stretches of unmodified bases were 
absent from deproteinized chromatin fibers 
treated with m6A-MTases (fig. S5). Together, 
these results show that Fiber-seq accurately 
translates m6A single-molecule chromatin sten- 
cils into a linear, nucleotide-precise readout of 
the primary architecture of chromatin. 


ow 


$2 nuclei treated with Hia5S 
(Units) 


- Untreated 


All-or-none actuation of regulatory DNA on 
individual fibers 

We next sought to identify how regulatory 
information encoded in genomic DNA is ac- 
tuated on individual chromatin fibers. Regulatory 
DNA actuation (the all-or-none adoption of a 
nucleosome-free state with resulting hyperacces- 
sibility of the underlying DNA) is fundamental 
to cell state and fate decisions (37) as well as 
phenotypic traits and disease risk (32), yet it is 
currently unknown whether regulatory elements 
are actuated in an all-or-none fashion on any 
given chromatin fiber (in place of a canonical 
nucleosome) or whether regulatory DNA is ac- 
tuated to varying degrees on all chromatin fibers 
as alternative structures with varying DNA 
accessibility. Evaluation of individual chroma- 
tin fibers overlapping each DHS demonstrated 
that overall, only 81% of these fibers had large 
MADs consistent with an actuated or open 
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Fig. 1. Nonspecific m6A-MTases selectively mark sites of chromatin 
accessibility. (A) Schematic of cleavage- and m6A-MTase-based methods for 
marking sites of chromatin accessibility. Tn5, transposon 5. (B) Dot-blot 
quantification of m6A-modified genomic DNA (gDNA) from S2 cell nuclei after 
treatment with the m6A-MTase Hiad. (C) Experimental schematic for AdMTase-seq. 
(D and E) Genomic loci comparing the relationship between DNase I-seq signal 
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and AdMTase-seq signal after treatment of S2 cell nuclei with (D) five separate 
m6A-MTases or (E) increasing amount of the m6A-MTase Hiad. The y axis is identical 
for all AdMTase-seq and m6A-immunoprecipitation-sequencing (m6A-|P-seq) 
experiments. (F) Comparison of AdMTase-seq signal for S2 cell DHSs from cells 
treated with (top) Hia5 versus EcoGll or (bottom) bulk DNase I-seq signal. 

The y axis is identical for all AdMTase-seq and m6A-IP-seq experiments. 
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chromatin state overlapping the DHS, with the 
remaining fibers showing nucleosome demar- 
cation indicative of a closed state at the DHS (Fig. 
3, A and B, and fig. S6, A to C). Transcription 
start site (TSS)-distal DHSs were preferentially 
maintained in a closed state when compared with 
promoter DHSs (Fig. 3B), and the rate of DNA 
actuation at individual TSS-distal DHSs mir- 
rored the density of DNase I cleavages quan- 
tified by DNase I-seq from bulk nuclei (Fig. 3C). 
By contrast, wider TSS-distal DHSs, as well as 
promoter DHSs, were preferentially maintained 
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in an accessible state (Fig. 3B and fig. S6D), with 
the actuated DNA content of these DHSs demon- 
strating widespread heterogeneity across indi- 
vidual chromatin fibers (Fig. 2, E and G) because 
of the variable punctuation of co-occupying 
nucleosomes (fig. S6E). Together, these findings 
demonstrate that the regulatory DNA content at 
TSS-distal regulatory elements is predominately 
modulated by all-or-none DNA actuation, with 
the regulatory DNA content at larger promoter 
elements additionally modulated by the varia- 
ble punctuation of co-occupying nucleosomes. 
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Co-actuation of neighboring elements on 
single chromatin fibers 

Linear clustering of gene regulatory elements 
along complex genomes is well described, 
canonically in the context of locus control re- 
gions (LCRs) or “super-enhancers” (e.g., the 
beta-globin LCR) or gene-specific control re- 
gions (e.g., the BCLIIA enhancer region) 
(33-36). It is currently unknown whether the 
individual elements in such clusters are actu- 
ated coordinately or independently or, more 
generally, whether actuation of one genomic 
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Fig. 2. Fiber-seq exposes base pair-resolution maps of individual chroma- 
tin fiber architecture. (A) Experimental schematic for Fiber-seq. (B) Histogram 
of read lengths for chromatin fibers sequenced using Fiber-seq. Avg., average. 
(C) Percentage of chromatin fibers with m6A-methylated bases from DNA 
isolated from untreated and Hia5-treated S2 cell nuclei. (D and E) Genomic loci 
comparing the relationship between DNase |-seq, AdMTase-seq, and Fiber-seq. 
Individual PacBio reads (chromatin fibers) are marked with gray lines, and m6A- 
modified bases are marked in purple dashes. White regions separate individual 
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reads. Insets show (D) DHS with individual m6A-modified bases and (E) a 
comparison of multiple fibers overlapping a single DHS. (F) Scatterplot 
comparing DNase |-seq signal and the average Fiber-seq m6A signal at each DHS. 
(G) (Top) Histogram of MAD widths for all MADs identified outside of DHSs (gray) 
or in promoter DHSs (green). Box-and-whisker plots for the aforementioned in 
addition to MAD widths for all MADs identified in TSS-distal DHSs (blue). (Bottom) 
Box-and-whisker plots showing the standard deviation in MAD widths across 
multiple fibers overlapping a DHS. *P value < 0.001 (Wilcoxon test). 
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Fig. 3. Preferential co-actuation of neigh- 
boring regulatory elements on the same 
chromatin fiber. (A) Representative genomic 
locus comparing the relationship between 
DNase |-seq, AdMTase-seq, and Fiber-seq at 
neighboring DHSs exposes the predominant 
all-or-none actuation of individual DHSs. 

(B) Proportion of DHSs that overlap actuated 
versus closed fibers for DHSs divided based on 
their proximity to TSSs as well as the gene 
expression level of the TSS. Error bars represent 
95% confidence intervals. *P value < 0.001 
(z-test). RPKM, reads per kilobase per million 
reads. (C) Scatterplot showing the relationship 
between DHS actuation and the bulk DNase 
I-seq signal for TSS-distal DHSs. (D) For 
chromatin fibers encompassing two DHSs, 
shown is the percentage of fibers containing 
accessible MADs at both DHSs as a function 
of the distance between the DHSs. Error 

bars represent 95% confidence intervals. 

*P < 0.001 (z-test). 
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leosome positioning. (A) Schematic showing the calculation of nucleosome (Nucs.) positioning using 


overlapping reads as well as a box-and-whisker plot of nucleosome offsets at different genomic loci. *P value < 0.001 (Wilcoxon test). (B and C) (Top) Box-and- 

whisker plots showing nucleosome offsets neighboring (B) promoter DHSs and (C) TSS-distal DHSs. (Bottom) Bar plots demonstrating the difference in nucleosome 
positioning neighboring (B) promoter DHSs and (C) TSS-distal DHSs between actuated and closed fibers. P < 0.001 (Wilcoxon test). (D) Schematic of the boundary 
model for nucleosome placement surrounding regulatory elements. 


element can influence actuation of neighboring 
elements. As 29% of Fiber-seq reads overlap 
multiple neighboring DHSs (Figs. 2E and 3A), we 
next asked whether regulatory DNA actuation at 
one genomic DNA element could influence 
the actuation of neighboring elements on the 
same chromatin fiber. Overall, we found that 
neighboring DHSs were significantly more 
likely to be co-actuated on the same chromatin 
fiber (Fig. 3D). Co-actuation was most strongly 
enriched for the most tightly clustered elements 
(Fig. 3D), indicating that the actuation of reg- 
ulatory DNA accessibility at one distal element 
appears to potentiate accessibility at neigh- 
boring elements in a distance-dependent 
manner, independent of whether both elements 


were bound by the same TF (fig. S7). These 
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results provide a possible physicochemical 
basis for the observed clustering of distal re- 
gulatory elements in animal genomes. More- 
over, they suggest that genetic variants affecting 
regulatory DNA accessibility and function may 
create local knock-on effects in cis, a feature 
not accounted for in current models of the 
architecture and evolution of gene regulation. 


Nucleosome arrays are bounded by actuated 
regulatory DNA 


Nucleosome positioning is fundamental to 
gene regulation and is specified by a combi- 
nation of factors, including DNA sequence, 
the competitive occupancy of TFs, the action 
of nucleosome remodelers, and interactions 
with RNA polymerases (37). Although nucle- 
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osomes surrounding accessible promoters 
and TSS-distal DHSs are generally well posi- 
tioned (38-40), it remains unclear whether this 
positioning results from underlying nucleosome- 
favoring DNA sequences or from a boundary 
condition imposed by accessibility and TF oc- 
cupancy at regulatory DNA. We reasoned that 
the boundary model of nucleosome position- 
ing could be tested directly by comparing nu- 
cleosome positions surrounding a regulatory 
element on overlapping fibers in which the 
regulatory element is in an actuated or open 
state versus overlapping fibers in which the 
regulatory element is in a closed state (as such 
fibers have the same DNA content and should 
differ only in the presence or absence of an 
actuated element). Although nucleosomes 
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CTCF interactions. (A) Genomic loci comparing the relationship between CTCF 
chromatin immunoprecipitation sequencing (ChIP-seq), DNase |-seq, and Fiber-seq 
at CTCF binding sites in K562 cells. The inset shows a CTCF binding site and 


per-fiber m6A methylation. (B) Plot of adenine methylat! 
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surrounding DHSs were collectively well posi- 
tioned (Fig. 4A), analysis of single-fiber data 
showed that well-positioned nucleosomes 
largely originated from fibers in which the 
regulatory element is in an actuated state (Fig. 
4, B and ©), indicating that nucleosome posi- 
tioning at these locations is largely dependent 
on the actuation of regulatory DNA, not the 
DNA sequence itself. As such, nucleosome 
positioning appears to largely result from a 
boundary condition imposed by regulatory 
DNA actuation on individual chromatin fibers 
(Fig. 4D). 


Fiber-seq of human chromatin 


To test Fiber-seq on the human genome, we 
first validated that m6A-MTases can selectively 
mark cell type-specific accessible DNA in 
human cell types (fig. S8A). We then performed 
Fiber-seq on human K562 nuclei, resulting in 
robust demarcation of both accessible regu- 
latory elements and internucleosomal linker 
regions (fig. S8, B to E) with an average cover- 
age of 3.7 high-quality reads per DHS. Consist- 
ent with the findings from Drosophila cells, 
we found that regulatory DNA accessibility in 
K562 cells is predominantly actuated as an 
all-or-none process (fig. S8F), with larger pro- 
moter elements additionally demonstrating 
variably placed co-occupying nucleosomes (fig. 
S8G). Furthermore, neighboring TSS-distal DHSs 
were significantly more likely to be co-actuated 
on the same chromatin fiber (fig. SSH), and 
nucleosome positioning surrounding regula- 
tory DNA was largely dependent on regulatory 
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S8, I and J), indicating that these are likely 
universal effects of regulatory DNA actuation. 


Single-molecule TF occupancy at 
nucleotide resolution 


Analysis of K562 Fiber-seq patterns in ac- 
cessible regulatory elements revealed that m6A- 
MTase methylation was not uniform but rather 
was punctuated by short gaps, consistent with 
nucleotide-precise binding of TFs that mirror 
bulk TF footprints on non-cross-linked chro- 
matin (Fig. 5, A and B, and fig. S9) (47). We 
next explored these single-molecule TF occu- 
pancy events to determine the relationship 
between TF occupancy and regulatory DNA 
actuation and function, focusing on CTCF be- 
cause most of its binding sites lack additional 
co-bound TFs, and CTCF occupancy is considered 
essential for establishing long-range chromatin 
interactions (42). TFs are known to undergo 
rapid exchange at regulatory DNA elements 
(43), yet it remains unclear the extent to which 
regulatory elements can remain actuated during 
the transient absence of TF occupancy (44-46). 
We observed that most of the actuated chro- 
matin fibers overlapping CTCF-bound regula- 
tory elements lacked a CTCF footprint (Fig. 5, 
B and OC), suggesting that regulatory DNA ac- 
tuation can be maintained in the absence of 
immediate occupancy by a gating TF. Notably, 
CTCF elements that preferentially contained 
fibers with a CTCF footprint were more likely 
to participate in long-range chromatin inter- 
actions (Fig. 5D), providing a mechanistic link 
between single-molecule CTCF occupancy and 


DNA accessibility as a boundary condition (fig. 
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footprint or (bottom) have a CTCF footprint. (C) Percentage of fibers overlapping 
CTCF ChIP-seq peaks containing a bulk CTCF DNase | footprint that lack an 
accessible MADs or have an accessible MADs with or without a CTCF footprint. 
(D) Average number of RAD21 ChIA-PET long-range chromatin interactions at CTCF 
binding sites based on the percentage of overlapping fibers that contain a single- 
molecule CTCF footprint. *P value < 0.01 (z-test). 


Outlook 

In conclusion, we have shown that Fiber-seq 
provides a robust and scalable approach for 
stenciling the primary architecture of individual 
chromatin fibers onto their underlying DNA 
templates with high precision and resolution. As 
both sequencing read lengths and throughput 
increase, it should be possible in the near future 
to transcribe and resolve the primary regulatory 
architectures of entire genetic haplotypes by 
simultaneously mapping both the primary genetic 
sequence and overlying chromatin state. As such, 
Fiber-seq has the potential to provide a unifying 
tool for analyzing the gene regulatory impact of 
both rare and common regulatory DNA variation, 
and for resolving extended regulatory alleles. 
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Cryo-EM structure of the human 
cohesin-NIPBL-DNA complex 


Zhubing Shi, Haishan Gao’, Xiao-chen Bai~**, Hongtao Yu'** 


As a ring-shaped adenosine triphosphatase (ATPase) machine, cohesin organizes the eukaryotic genome by 
extruding DNA loops and mediates sister chromatid cohesion by topologically entrapping DNA. How cohesin 
executes these fundamental DNA transactions is not understood. Using cryo-electron microscopy (cryo-EM), 
we determined the structure of human cohesin bound to its loader NIPBL and DNA at medium resolution. 
Cohesin and NIPBL interact extensively and together form a central tunnel to entrap a 72—base pair DNA. NIPBL 
and DNA promote the engagement of cohesin’s ATPase head domains and ATP binding. The hinge domains of 
cohesin adopt an “open washer” conformation and dock onto the STAG1 subunit. Our structure explains 

the synergistic activation of cohesin by NIPBL and DNA and provides insight into DNA entrapment by cohesin. 


ohesin is a multisubunit adenosine tri- 

phosphatase (ATPase) machine with key 

roles in genome organization and chro- 

mosome segregation (7-4). Cohesin me- 

diates the formation of chromatin loops 
and topologically associating domains by 
loop extrusion (5-12). During DNA replica- 
tion, cohesin topologically entraps both sister 
chromatids to establish sister chromatid cohe- 
sion (13, 14), which is critical for accurate chromo- 
some segregation. 

Human cohesin consists of four subunits: 
SMCI, SMC3, RAD21, and either STAG1 or STAG2 
(Fig. 1A). The homologous ATPases SMC1 and 
SMC3 heterodimerize through their hinge do- 
mains. The kleisin subunit RAD21 (Scc1/Mcd1 
in yeast) links their ATPase head domains (HDs) 
to form a ring (5-17). The huntingtin, elonga- 
tion factor 3, A subunit, and TOR (HEAT) repeat 
proteins STAGI and STAG2 (Scc3 in yeast) bind 
RAD21 and provide docking sites for other 
regulators (18, 19). The NIPBL-MAU2 loader 
complex (Scc2-Scc4 in yeast) loads cohesin onto 
chromosomes and associates with chromosome- 
bound cohesin to promote loop extrusion (6, 9, 20). 
The HEAT repeat protein NIPBL (Scc2) alone 


NIPBL 
ATP 
STAG1 RAD21 
Kleisin 
compartment 


can stimulate the ATPase activity of cohesin in the 
presence of DNA and is sufficient for cohesin 
loading onto DNA and loop extrusion (22, 21, 22). 
Chromosome-bound cohesin is dynamic and 
can be released by the releasing factor WAPL 
with the help of the scaffold protein PDS5 
(23, 24). Acetylation of SMC3 by the acetyl- 
transferases ESCO1 and ESCO2 antagonizes 
WAPL-PDS5-dependent cohesin release and 
stabilizes cohesin on chromosomes (25-29). 
The crystal structures of each cohesin sub- 
unit alone or in subcomplexes have been de- 
termined (J5, 17, 18, 30-38). Low-resolution 
negative-stain electron microscopy (EM) maps 
of an engineered human cohesin complex have 
also been reported (39). However, the lack of 
high-resolution structures of large cohesin 
complexes hinders our understanding of how 
cohesin interacts with DNA and how NIPBL 


promotes the loading of cohesin onto DNA. 
Here, we report the cryo-electron microscopy 
(cryo-EM) structure of human cohesin bound to 
the C-terminal HEAT repeat domain of NIPBL 
(NIPBL‘; residues 1163 to 2804) and DNA. This 
structure, together with crystal structures of 
the human SMC1-SMC3 hinge domains, reveals 


how cohesin-NIPBL embraces DNA and estab- 
lishes the basis for NIPBL-DNA-dependent 
activation of cohesin’s ATPase activity. 


Overall structure of human 
cohesin-NIPBL-DNA complex 


NIPBL* is sufficient to stimulate the ATPase 
activity of cohesin in the presence of DNA and 
to promote loop extrusion by cohesin (12). We 
thus reconstituted the cohesin-NIPBL°-DNA 
complex for structural studies. To capture 
cohesin in the ATP-bound state, we used the 
ATPase-deficient SMC1 Glu"’—Gln (E1157Q)/ 
SMC3 E1144Q (cohesin®2) mutant for com- 
plex assembly (fig. S1). We incubated cohe- 
sin®® and NIPBL‘ with the nonhydrolyzable 
ATP analog adenylyl-imidodiphosphate (AMP- 
PNP) and a 72-base pair double-stranded DNA 
(dsDNA), subjected the complex to mild cross- 
linking to stabilize the complex, and performed 
cryo-EM analysis (figs. S2 and S3). 

The three-dimensional (3D) reconstruction 
of the cohesin®®-NIPBL°-DNA complex was 
first performed at an overall resolution of 
4.0 A (fig. $3). To further improve the res- 
olution, we performed a focused refinement 
of the more stable core of the complex, in- 
cluding NIPBL®, RAD21, and the ATPase HDs 
of SMC1-SMC3. The resulting cryo-EM map 
at 3.9 A showed improved density for this 
core, allowing us to build a nearly complete 
model, with the help of crystal structures of 
individual subunits (15, 17, 33, 37) (fig. S4, 
table S1, and movie S1). 

The cryo-EM density for STAG1 was less 
well resolved, presumably because of low 
binding occupancy and structural flexibil- 
ity (fig. $3). Subsequent application of fur- 
ther 3D classification with local angular 
search into six classes identified a single 
class with good density for STAGI (fig. S3). 
Further 3D refinement of the correspond- 
ing particles from this class produced a map 
with an overall resolution of 5.3 A, which 


Fig. 1. Overall structure of the human cohesin-NIPBL°-DNA complex. (A) Schematic representation of the human cohesin-NIPBL complex. The SMC and kleisin 
compartments are indicated. (B) Surface (left) and cartoon (right) representations of the cryo-EM structure of the cohesin-NIPBL°-DNA complex in two different 
views. SMC1, SMC3, RAD21, STAG], NIPBL, and DNA are colored cyan, purple, green, orange, pink, and gray, respectively. 
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Fig. 2. Interactions between NIPBL° and the SMC1-SMC3 heterodimer. (A and B) Cryo-EM map at a resolution of 3.9 A (A) and cartoon illustration (B) of 

the cohesin-NIPBL°-DNA complex. (C) ATPase active sites of the SMC1-SMC3 heterodimer. AMP-PNP is shown as sticks. Conserved motifs in SMC1 and SMC3 HDs 
are indicated. (D) Interfaces between NIPBL° and the SMC1-SMC3 heterodimer. (E) Close-up view of interface |. The NIPBL E-loop is colored purple. (F) Close-up 
view of the electrostatic interactions involving the acetylation acceptor lysines (K105 and K106) of SMC3 at interface |. 


showed improved density for STAGI (fig. 
S5, table S1, and movie S2). The structural 
model of STAGI, generated by homology 
modeling using the crystal structure of 
STAG2 as template (78), can be unambig- 
uously docked into the cryo-EM density on 
the basis of clear secondary structural fea- 
tures (Fig. 1B). 

In this cryo-EM map, we identified addi- 
tional density docked on the outer surface of 
STAGI (fig. S5). On the basis of the overall 
shape and secondary structure features, this 
density could be attributed to the SMC1-SMC3 
hinge domain heterodimer (30) (movie S3). 
Most of the SMC1 and SMC3 coiled coils (CCs), 
large segments of the flexible region of RAD21, 
and the N- and C-terminal unstructured parts 
of STAGI and NIPBL® were completely un- 
resolved. As the shape of each DNA base was 
not clearly defined at 3.9-A resolution, we used 


Department of Pharmacology, University of Texas 
Southwestern Medical Center, Dallas, TX 75390, USA. 
“Department of Biophysics, University of Texas Southwestern 
Medical Center, Dallas, TX 75390, USA. “Department of Cell 
Biology, University of Texas Southwestern Medical Center, 
Dallas, TX 75390, USA. “School of Life Sciences, Westlake 
University, Hangzhou, Zhejiang 310024, China. 
*Corresponding author. Email: xiaochen.bai@utsouthwestern. 
edu (X.-c.B.); hongtao.yu@utsouthwestern.edu (H.Y.) 


Shi et al., Science 368, 1454-1459 (2020) 


26 June 2020 


A-T base pairs, instead of the original DNA 
sequence, for DNA modeling. 

The cohesin-NIPBL°-DNA complex consists 
of three layers that tightly pack against each 
other (Fig. 1B and movie 82). The first layer 
comprises the V-shaped SMC1-SMC3 hetero- 
dimer and the N-terminal helical domain 
(NHD) and C-terminal winged helix domain 
(WHD) of RAD21. NIPBL¢ and a flexible seg- 
ment of RAD21 make up the middle layer. The 
third layer contains STAG], a central segment 
of RAD21, and the SMC1-SMC3 hinge domains. 
Being the main component of the middle layer, 
NIPBL* packs against the SMCI-SMC3 hetero- 
dimer on one side and STAGI on the other. 

The highly extended protein RAD21 tethers 
all three layers through interactions with SMC1- 
SMC3, NIPBL‘, and STAGI (Fig. 1B). RAD21 in- 
teractions with SMC1, SMC3, and STAGI are 
consistent with those previously reported for 
homologous subcomplexes (15, 17, 18). DNA 
binds at a central tunnel perpendicular to the 
three layers and bends ~45° at the NIPBL°- 
SMC3 interface (Fig. 1B). It contacts all three 
layers and stabilizes the entire complex. 


Structure of the SMC1-SMC3 heterodimer 


Both SMC1 and SMC3 ATPase HDs are com- 
posed of N- and C-lobes and contain key ele- 


ments for ATP binding and hydrolysis. In the 
structure of the cohesin-NIPBL°-DNA com- 
plex, SMC1 and SMC3 HDs form a heterodimer 
and bind two AMP-PNP molecules at the dimer 
interface through conserved motifs (Fig. 2, A 
to C, and fig. S6A). Consistent with previous 
results (15, 17), RAD21 interacts with SMC1 
and SMC3 through its WHD and NHD, respec- 
tively (fig. S6, B to D). 

The major parts of SMC1 and SMC3 CCs are 
missing in the EM map; however, a small CC 
segment connected to the SMC3 HD can be 
modeled (fig. S6, B and C). The distal end of 
this CC segment folds into a short four-helix 
bundle because of breaks in the C-terminal helix 
of the CC. The positions of these breaks cor- 
respond to the “SMC joint,” which was first de- 
fined in yeast Smc3 and bacterial Smc proteins 
(15, 40) (fig. S6, C and D). The SMC joint is more 
likely to undergo bending motions, as only one 
helix in the CC is contiguous at this position. 


Interactions between NIPBL° and cohesin 


NIPBL¢ adopts a U-shaped fold, which con- 
tains 24 HEAT repeats (R1 to R24) and a heli- 
cal insert domain (HID) (33, 37) (Fig. 2B and 
fig. S7A). A continuous and narrow density was 
observed in the central cleft of the “U” (fig. 
S7A). We previously showed that a Chaetomium 
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thermophilum Scc1 fragment (residues 126 to 
230) interacts with Scc2 in vitro (33). This re- 
gion of Sccl corresponds to residues 120 to 194 
of human RAD21. On the basis of clearly de- 
fined side-chain densities, we could assign the 
density in the NIPBL cleft to residues 154 to 
171 in RAD21 (fig. S7A). Extra unassigned den- 
sities extending from this RAD21 segment to 
the bottom of the NIPBL “U” likely also belong 
to RAD21 (Fig. 2A and movie S1). RAD21 binds 
to NIPBL through both electrostatic and hy- 
drophobic interactions (fig. S7, B and C). The 
relevance of the NIPBL-RAD21 interface is sup- 
ported by previous mutagenesis data on the 
C. thermophilum proteins (33). 

NIPBLS forms three interfaces with SMC1- 
SMC3 (denoted as interfaces I, II, and III) 
(Fig. 2D). Interface I is mainly formed by both 
SMC1 and SMC3 HDs and the C-terminal arm 
of the “U” in NIPBL—i.e., HEAT repeats R18 
to R21. NIPBL simultaneously contacts the 
SMC1 HD C-lobe and SMC3 HD N-lobe and 
strengthens the heterodimeric engagement of 
the two HDs through an extensive interaction 
network (Fig. 2E and fig. $8, A and B). This 
network involves the SMC1 F-loop, which 
resembles the W-loop of SMC4 (4D), the SMC3 
R-loop, and the loop connecting R18 and R19 
in NIPBL‘, which is termed the E-loop. No- 


SMC1 
Closed (2WD5) 


Fig. 3. Interactions among STAGI-RAD21, NIPBL°, and the SMC1-SMC3 
hinge heterodimer. (A) Cartoon representation of the subcomplex of 
STAGI-RAD21, NIPBL®, and the SMC1-SMC3 hinge in two different views. 
The EM map of these components (colored gray) is overlaid on the cartoon 
in the left panel. (B to E) Structures of the SMC1-SMC3 hinge heterodimer 
in the closed conformation (PDB ID 2WD5) (B), the north-open conformation 
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tably, Lys! and Lys'®° in the N-lobe of SMC3 
HD, which are the acetylation sites of ESCO1 
and ESCO2 (25-27), appear poised to form 
electrostatic interactions with Glu’*” and Glu’? 
in RU of the NIPBL® N-terminal arm (Fig. 2F). 
Acetylation of these two lysines is expected to 
neutralize their positive charges, disrupt fa- 
vorable electrostatic interactions with NIPBL, 
and inhibit cohesin loading and possibly loop 
extrusion. Indeed, Ecol limits loop expansion 
in yeast (42). Mutations of the SMC3 deacety- 
lase HDAC8 have been linked to human de- 
velopmental diseases termed cohesinopathies 
(43). HDAC8-deficient cells exhibit elevated 
levels of SMC3 acetylation and deficient cohesin 
loading at certain genomic loci. The functional 
importance of Glu’®*? and Glu of NIPBL re- 
mains to be validated. 

Interface IT is between the SMC1 CC and the 
C-terminal region of NIPBL® (Fig. 2D and fig. 
S8C). It mainly involves hydrogen bonding 
and hydrophobic interactions. Interface II is 
between the SMC3 joint and the N-terminal 
region of NIPBL°—namely, HEAT repeats R2 to 
R6—and also involves the RAD21 NHD (Fig. 2D 
and fig. S8D). This interface comprises exten- 
sive electrostatic and hydrophobic interactions. 
Compared with the Scc2 HEAT repeat alone 
(33), the N-terminal arm of NIPBL* undergoes 


f RAD21 


at 1.20). 


North open (DNA-bound) 


a large conformational change upon binding to 
cohesin, which may facilitate their interaction. 
Overall, NIPBL associates with both SMC1 
and SMC3 HDs and CCs to stabilize the HD- 
engaged conformation of SMC1-SMC3 and 
helps to create a closed SMC compartment for 
DNA entrapment. 

Similar to NIPBL, STAGI1 also adopts a 
U-shaped fold. In the cohesin-NIPBL°-DNA 
complex, the N-terminal arm of the “U” in 
NIPBL packs against the N-terminal arm of 
the “U” in STAGI1 in an antiparallel manner 
(Fig. 3A). A major interface is formed by HEAT 
repeats R8 to R10 of STAGI and R1 to R5 and 
HID of NIPBL (fig. S9A). At a second, smaller 
interface, two long helices from R1 and R2 
in the N-terminal region of STAG1 make con- 
tacts with the HEAT repeat R13 in NIPBL 
(Fig. 3A and fig. S9B). Through performing 3D 
classification, we observed short-range hinge 
motions of the N-terminal region of STAG1 
relative to NIPBL (fig. S9C), suggesting that 
this interface is fluid. 


The SMC1-SMC3 hinge domains directly 
contact STAG1 


The SMC1-SMC3 hinge heterodimer makes 
direct contact with the bottom part of the “U” 
of STAGI (Fig. 3A). This unexpected interaction 


South open (alone) 


in the cryo-EM structure of the cohesin-NIPBL°-DNA complex (C), the 
north-open conformation in the crystal structure of the hinge with short 
CCs bound to ssDNA (D), and the south-open conformation in the 
crystal structure of the hinge (E). The electron density of the two deoxy- 
thymidines (dT) is shown as gray mesh (2mFo-DFc map contoured 
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is mainly mediated by the SMC1 hinge and 
HEAT repeats R8 and R9 of STAG] (fig. SIOA). 
Consistent with these structural observations, 
a previous study showed that the hinge do- 
main of Psm1-Psm3, the SMC1-SMC3 homolog 
in fission yeast, can bind to Psc3, the fission 
yeast homolog of STAG] and STAG2 (44). The 
N-terminal region of NIPBL‘ is also located in 
close proximity to the SMC1 hinge (Fig. 3A). 
An extra unassigned density extends from the 
N-terminal region of NIPBL¢ and inserts into 
a surface groove connecting the SMC1 hinge 
and STAGI (fig. S10B). Thus, there may also 
be interactions between the SMC1-SMC3 
hinge heterodimer and NIPBL. 

In the reported structure of the SMC1-SMC3 
hinge heterodimer (30), the two subunits 
are tightly packed through two distinct in- 
terfaces (termed north and south interfaces), 
forming a donut-shaped structure with a cen- 
tral pore (Fig. 3B). The north interface of the 
hinge heterodimer is disrupted in the cohesin- 
NIPBL°-DNA complex, producing a half-open 
conformation that resembles an open washer 
(Fig. 3C). The hinge domains of SMC com- 
plexes, including cohesin, have been shown to 
bind single-stranded DNA (ssDNA) or dsDNA 
(45-47). We did not, however, identify any 


Fig. 4. DNA binding by cohesin and NIPBL®. (A) Structure of the cohesin- 
NIPBL°-DNA complex, with protein subunits shown in cartoon representation and 
the EM map of DNA shown as gray surface. (B) Interface between DNA and the 
SMC1-SMC3 heterodimer. (C) Close-up view of the DNA and NIPBLS interactions 
involving the acetylation acceptor lysines of SMC3. (D) Interactions between DNA 
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density close to the SMC1-SMC3 hinge that 
could be attributed to DNA. 

We next determined the crystal structures 
of the isolated SMC1-SMC3 hinge heterodimer 
in the presence of DNA. The structures of 
the two different versions of the SMC1-SMC3 
hinge heterodimer adopt different open con- 
formations: one with the north interface 
open and the other with the south interface 
open (Fig. 3, D and E, and table S2). The 
conformation with the north interface open 
closely resembles that observed in the cohesin- 
NIPBL‘°-DNA complex (Fig. 3, C and D). Thus, 
both interfaces of the SMC1-SMC3 hinge are 
intrinsically dynamic and can undergo close- 
open transitions. Whether these transitions 
are coupled to the ATPase cycle of cohesin or 
further regulated by binding to STAG1 or DNA 
remains unknown. 

The crystal structure of the SMC1-SMC3 
hinge with the north interface open contained 
a two-nucleotide ssDNA bound to the outer 
surface of the SMC1 hinge (Fig. 3D and fig. 
S10C). The open conformation of the hinge in 
the cohesin-NIPBL°-DNA complex is likely ca- 
pable of binding ssDNA. The open hinge may 
act to capture ssDNA and then allow its en- 
try into the SMC chamber through the cen- 
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tral pore (48, 49). Notably, STAG1 and ssDNA 
contact overlapping surfaces in the SMC1 
hinge (fig. S10D), suggesting that they might 
compete for hinge binding. The potential 
function of this hinge surface remains to be 
determined. 


DNA binding by cohesin and NIPBL 


DNA is bound at the central tunnel of the 
cohesin-NIPBL© complex, contacting all sub- 
units (Fig. 4A). The major DNA binding pocket 
is formed by the heterodimer of SMC1-SMC3 
HDs, NIPBL, and the RAD21 NHD (which is 
bound to the CC region of SMC3) (Fig. 4B). 
DNA is tightly gripped by the V-shaped SMC1- 
SMC3 HD heterodimer, in a manner similar 
to DNA binding by the DNA repair protein 
Rad50 (50). The R-loop and the succeeding 
6 sheet in the N-lobe of both SMC1 and SMC3 
are the major sites for DNA binding (fig. S11). 
The acetylation sites Lys’®’ and Lys'°° in SMC3 
that contact NIPBL are also located in close 
proximity to the DNA and may contribute to 
DNA binding (Fig. 4C). Acetylation of Lys’®’ 
and Lys'°° of SMC3 may weaken DNA binding 
to the SMC3 HD and block the ability of DNA 
to stimulate the ATPase activity of cohesin, as 
previously proposed (44). 


SMC3 
AMP-PNP 


and NIPBL®. NIPBL mutations in patients with Cornelia de Lange syndrome are 
highlighted as spheres. (E) Interactions between DNA and STAGI. The DNA 
binding element of STAGI is colored purple. (F) Structural comparison of SMC1 
and SMC3 HDs bound to DNA. AMP-PNP and the conserved arginines in the 
R-loops of SMC1 and SMC3 are shown as sticks. 
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Fig. 5. DNA entrapment by cohesin. (A) Schematic representation of the cohesin-NIPBL complex in 
HD-engaged and HD-disengaged conformations. The HD-engaged ring conformation might exist in an equilibrium 
with a folded conformation with the hinge domains docked on STAGI. The SMC and kKleisin compartments 

of HD-engaged cohesin are indicated. These compartments merge in the HD-disengaged conformation. Owing 
to the lack of cryo-EM density for major parts of SMC CCs, it is unclear whether the interaction between 
STAG] and SMC hinge domains is intramolecular (depicted) or intermolecular (not depicted). (B) Model for 
cohesin loading onto DNA. DNA is captured by NIPBL and STAG] in the HD-disengaged cohesin. Upon 

ATP binding, SMC1 and SMC3 HDs engage, creating a DNA binding site between their CCs and an interface 


for NIPBL binding. RAD21-tethered NIPBL binds the S 
forms the central tunnel to entrap DNA. 


The HEAT repeat subunit Ycgi in the re- 
lated condensin complex directly interacts with 
DNA (52D. Similarly, the HEAT repeat proteins 
in cohesin, NIPBL and STAGI, serve as major 
DNA binding receptors (Fig. 4, D and E). 
DNA binding to NIPBL buries a large solvent- 
exposed surface of NIPBL (fig. $12A). Several 
NIPBL missense mutations found in patients 
with Cornelia de Lange syndrome (52) affect 
residues in this DNA binding site, as well as those 
in the RAD21 binding site (Fig. 4D). Mutations of 
yeast Scc2 residues that correspond to several 
DNA binding residues in NIPBL are lethal (22). 
These findings confirm the functional impor- 
tance of this NIPBL-DNA binding interface. 

STAGI1 binds to DNA through the N-terminal 
part of a helix in R8 and an extended loop that 
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C1-SMC3 heterodimer and, together with STAGI, 


follows the helix (Fig. 4E and fig. S12B). The 
DNA binding interface of STAG] in our struc- 
ture is different from that in the reported 
structure of the yeast Scc3-DNA complex (53) 
(fig. S12, C to F). Most DNA binding inter- 
faces of cohesin-NIPBL‘ are mediated by elec- 
trostatic interactions with the phosphate 
backbone. This feature explains the salt sen- 
sitivity and lack of sequence specificity in 
initial DNA binding by the cohesin-NIPBL 
complex (11, 12, 21). 


Cohesin activation by NIPBL and DNA 


SMCI1 and SMC3 belong to the family of ATP- 
binding cassette ATPases (54). They bind two 
ATP molecules at the dimer interface of their 
ATPase HDs (Fig. 2C). SMC1-SMC3 HD hetero- 


dimerization and ATP binding are interdepen- 
dent events. NIPBL simultaneously contacts 
both SMCI1 and SMC3 HDs (Fig. 4A), thus pro- 
moting their heterodimerization. Likewise, 
DNA also bridges both HDs to stabilize the 
heterodimer (Fig. 4B). Thus, one mechanism 
by which NIPBL and DNA activate the ATPase 
activity of cohesin is to promote and stabilize 
the ATP-binding, HD-engaged state of the 
SMC1-SMC3 heterodimer. 

DNA and NIPBL trigger both localized and 
domain-specific conformational changes in 
SMCI1 and SMC3 HDs. NIPBL and DNA con- 
tact and organize the loops involved in ATP 
binding. For example, DNA directly contacts 
the R-loop of SMC1 HD and organizes this 
loop for ATP binding at the SMC1 site (Fig. 4, 
B and F). NIPBL and DNA also induce a con- 
formational change of the SMC3 HD and CC. 
Compared to yeast Smc3 bound to Sccl NHD 
(15), the N-lobe of SMC3 HD in the intact 
complex undergoes a rotation relative to its 
C-lobe and CC (fig. S6D). This rotation po- 
sitions the N-lobe closer to the SMC1 HD. 
The conformations of SMC1 and SMC3 HDs 
observed here resemble that of an engineered 
yeast cohesin subcomplex (fig. S6E) (55). The 
remodeling of the active sites by NIPBL and 
DNA may also stimulate the ATPase activity 
of cohesin. 


Implications for cohesin loading 


SMC complexes can adopt an open ring-like 
conformation with engaged HDs or a rod- 
shaped conformation with disengaged HDs 
(39, 40, 56-58) (Fig. 5A). Our structure cap- 
tures the HD-engaged conformation with an 
open SMC compartment. DNA binds at the 
top of the engaged SMC1-SMC3 HDs and 
between the CCs—i.e., it is trapped inside the 
SMC compartment. The two HEAT repeat pro- 
teins NIPBL and STAGI further wrap around 
DNA, creating a central tunnel. This confor- 
mational state of the cohesin-NIPBL complex 
does not permit long DNA (e.g., eukaryotic 
chromosomes) to enter or escape from this 
tunnel, without undergoing conformational 
rearrangements. Thus, our structure of the 
cohesin-NIPBL°-DNA complex does not cap- 
ture the initial docking complex but rather rep- 
resents a postloading state. We propose that 
DNA is first recruited by NIPBL and STAGI 
to cohesin in the HD-disengaged state (Fig. 
5B). Upon ATP binding, SMC1 and SMC3 HDs 
engage to trap the DNA inside the SMC com- 
partment. NIPBL and STAGI then wrap around 
the DNA to form the central tunnel and further 
entrap the DNA. 

The cohesin-NIPBL°-DNA complex has a 
layered architecture with multiple DNA bind- 
ing sites, which suggests an inchworm model 
for DNA translocation and loop extrusion. 
In such a model, the ATP-bound, engaged 
SMC HDs together with NIPBL constitute 
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a strong DNA binding site. Once ATP is hydro- 
lyzed, the SMC HDs disengage and loosen 
their grips on DNA. SMC1 and SMC3 HDs 
redimerize upon ATP binding and, together 
with NIPBL, capture a downstream segment 
of DNA. The repeat of such a cycle could lead 
to processive translocation of cohesin on DNA. 
SMC3 acetylation, displacement of NIPBL by 
PDS5, and CTCF binding antagonize loop ex- 
trusion (19, 22, 42). 

Sister chromatid cohesion requires the topo- 
logical entrapment of both sister chromatids 
by cohesin (14). The topological loading of 
cohesin onto DNA involves the opening of any 
of the three interfaces: SMC1-RAD21, SMC3- 
RAD21, or the hinge. Our finding that the 
SMCI1-SMC3 hinge can adopt two types of 
open conformations raises the possibility that 
the hinge might be an entry gate for DNA. 
Consistent with this notion, mutations of cer- 
tain basic residues that line the central pore 
of the hinge disrupt sister chromatid cohesion 
in yeast (14, 30). 


Outlook 


In this study, we determined the cryo-EM 
structure of the intact human cohesin in com- 
plex with its loader NIPBL and DNA, catching 
the first glimpse of the overall architecture 
of this SMC complex in the DNA-bound state. 
Our structure explains a large body of availa- 
ble biochemical findings, provides insight into 
the mechanisms of cohesin loading, and offers 
a blueprint for further functional and mecha- 
nistic dissection of cohesin and related molec- 
ular machines. 
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Mechanisms of OCT4-SOX2 motif readout 
on nucleosomes 


Alicia K. Michael**, Ralph S. Grand”, Luke Isbel'*, Simone Cavadini, Zuzanna Kozicka‘, 
Georg Kempf", Richard D. Bunker'}, Andreas D. Schenk’, Alexandra Graff-Meyer’, Ganesh R. Pathare?, 
Joscha Weiss?, Syota Matsumoto’, Lukas Burger*’, Dirk Schiibeler’7+, Nicolas H. Thoma't 


Transcription factors (TFs) regulate gene expression through chromatin where nucleosomes restrict 
DNA access. To study how TFs bind nucleosome-occupied motifs, we focused on the reprogramming factors 
OCT4 and SOX2 in mouse embryonic stem cells. We determined TF engagement throughout a nucleosome at 
base-pair resolution in vitro, enabling structure determination by cryo-electron microscopy at two preferred 
positions. Depending on motif location, OCT4 and SOX2 differentially distort nucleosomal DNA. At one position, 
OCT4-SOX2 removes DNA from histone H2A and histone H3; however, at an inverted motif, the TFs only induce 
local DNA distortions. OCT4 uses one of its two DNA-binding domains to engage DNA in both structures, 
reading out a partial motif. These findings explain site-specific nucleosome engagement by the pluripotency 


factors OCT4 and SOX2, and they reveal how TFs distort nucleosomes to access chromatinized motifs. 


ranscription factors (TFs) regulate gene 
expression and govern cell identity by in- 
teracting with specific sequence motifs. 
OCT4 and SOX2 serve as reprogram- 
ming TFs (7) that cooperate as critical 
mediators of pluripotency (2-4). Although 
they are not obligate heterodimers in solu- 
tion, the combined OCT4-SOX2 motif drives 
OCT4-SOX2 complex formation (5, 6). 

Chromatin restricts DNA access (7, 8), but 
a specialized subset of TFs, termed pioneer 
factors, can engage chromatinized motifs to 
trigger cell-fate changes (9). Several TFs have 
been shown to bind motifs embedded in nu- 
cleosomes in vitro (J, 10, 11); however, the 
nucleosome architecture, with histones H2A, 
H2B, H3, and H4 and its two DNA gyres (72), 
restricts TF access to >95% of nucleosomal 
DNA (13). Two extreme scenarios for nucleo- 
somal TF-engagement have been put forward: 
TF binding without changing the nucleosomal 
architecture (10, 14) or TF-mediated changes 
to the nucleosome by distorting the histone 
core, looping the DNA, or taking advantage 
of nucleosome unwrapping dynamics at the 
entry-exit sites (15, 16). 

OCT4 binding is predicted to be incompati- 
ble with the nucleosome architecture on the 
basis of its engagement with free DNA (17-19), 
although partial motifs have been identified 
where OCT4 engages only a portion of its 
binding site to maintain nucleosome integ- 
rity (1). Despite being critical for genome 
regulation, the structural and mechanistic 
principles governing nucleosome engage- 
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ment by single or multiple TFs have yet to be 
determined. 


SeEN-seq reveals preferential binding of 
OCT4-SOX2 to nucleosomal entry-exit sites 


Each rotational and translational setting of a 
motif around a nucleosome places the TF in a 


AOCT4-SOX2 motif nucleosome gel shift assay 


next-generation 


different histone-DNA environment. To assess 
all potential TF-binding registers on a nucleo- 
some, we built on approaches that examine 
select loci (20, 21) and developed selected 
engagement on nucleosome sequencing 
(SeEN-seq), which measures the relative 
affinity of one or multiple TFs for each nucleo- 
somal register in parallel (Fig. 1A). We focused 
on OCT4 and its dependence on SOX2 because 
both factors show strong co-occupancy at tar- 
get genes in vivo (22, 23), and the OCT4-SOX2 
protein-protein interface is required for pluri- 
potency (24). The canonical OCT4-SOX2 motif 
(matrix ID MA0142.1) (25) was tiled at 1-base 
pair (bp) intervals through a 601-nucleosome 
positioning sequence (26). The sequence li- 
brary was then assembled into octameric nu- 
cleosome core particles (NCPs), incubated with 
the TF(s), and subjected to electrophoretic 
mobility shift assay (EMSA). TF-bound and 
unbound nucleosome complexes were excised 
and subjected to next-generation sequencing 
(NGS), which generated single molecule counts 
that approximate motif affinity as a function 
of position. 

Arange of protein concentrations of either 
OCT4, SOX2 (residues 37 to 118), or OCT4-SOX2 
together were assessed, with a total of 4752 
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Fig. 1. SeEN-seq identifies preferred binding sites of OCT4 and OCT4-SOX2 across the nucleosome. 
(A) Principle of SeEN-seq. A library of TF motif-containing nucleosome positioning sequences is assembled into 
nucleosomes and incubated with TF(s). TF-bound and unbound nucleosome complexes are separated by 
EMSA and sequenced, revealing position-specific enrichments. Stars indicate a specific example sequence at 
each step of the assay. (B) SeEN-seq enrichment (n = 3 replicates) for nucleosome pool with or without TFs. 
Further details are provided in fig. SIC. Motif position is indicated by superhelix location (SHL) that describes 
where the minor groove faces away (SHLs +1, +2, etc.) or toward (SHLs +15, +2.5, etc.) the histone octamer. 


(C) Autocorrelation analysis of OCT4 enrichment; dashed 


lines indicate 95% confidence interval. (D) DNA 


nucleosome unwrapping energy (15) versus OCT4 and OCT4-SOX2 SeEN-seq enrichment profile. Arrows indicate 
enriched positions in OCT4. DG, delta Gibbs free energy; KBT, 0.6 kcal/mol, where T = 300 K. 
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conditions measured with high reproducibility 
(Fig. 1B and fig. S1, A to C). The trends in 
OCT4 or SOX2 binding at selected positions 
were validated by fluorescence polarization 
measurements (fig. S2, A to E). Previously, 
two OCT4 motif locations were tested and 
found to provide similar OCT4 access to the 
nucleosome-embedded motif (27). Although 
this was recapitulated in SeEN-seq, our com- 
prehensive examination of all motif locations 
reveals clear OCT4: preference for nucleosomal 
DNA entry-exit sites as well as discrete prefer- 
ential binding sites ~1 to 3 bp in width throughout 
the nucleosome (Fig. 1B and fig. SIC). SOX2 shows 
less differential enrichment in SeEN-seq, in line 
with published data (70), with some degree of 
preferred binding toward the entry-exit sites 
and near the dyad (Fig. 1B and fig. SIC). Given 
the small enrichment amplitudes for SOX2 
alone, we focused on the robust and differ- 
ential binding activity seen for OCT4 and how 
it is affected by SOX2. OCT4 and SOX2 cooperate 
strongly, which results in up to 650-fold in- 
creased binding compared with that caused 
by OCT4 alone (fig. S1C). This effect is most 
evident at entry-exit sites with weaker coop- 
erativity observed at internal sites (fig. SIC). 
Although OCT4-SOX2 binding appears 
roughly symmetrical across the dyad, for OCT4 
alone, enrichment in the right half of the nu- 
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Fig. 2. Cryo-EM structure of OCT4-SOX2-NCPS#“-6 complex. (A) Domain 
schematic of OCT4 and SOX2 constructs. GFP, green fluorescent protein; TAD, 
transactivation domain. (B) Cryo-EM map of OCT4-SOX2-NCPS#*~ at 3.1-A resolution. 
(C) Model of the OCT4-SOX2-NCPS#*> complex. The H3 N-terminal helix and tail 
(shown as spheres) stabilize DNA at the entry-exit in a canonical nucleosome (31). The 
presence of OCT4-SOX2 increases the distance between H3 (Arg“) and the nearest 
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cleosome [superhelix locations (SHLs) +4 to 
+6.5] is more pronounced compared with that 
in the left (SHLs —4 to —6.5) (Fig. 1B and fig. 
S1C). A notable difference is the motif orien- 
tation on either side of the dyad; on the left, 
the OCT4 portion of the motif is closest to 
the dyad, whereas on the right it is oriented 
toward the entry-exit site of the nucleosome. 
Positions enriched for OCT4 alone and OCT4- 
SOX2 exhibited stronger binding at discrete 
sites with 10-bp periodicity across the nucleo- 
some (Fig. 1C and fig. $2, F and G). Both OCT4 
and OCT4-SOX2 show a trend of stronger 
binding at the entry-exit site than at the dyad 
(Fig. 1D). This would be expected if OCT4 and 
OCT4-SOX2 binding was facilitated by nucleo- 
somal breathing, which is more pronounced 
toward the nucleosomal entry-exit sites (15). 
However, binding is not simply a function of 
nucleosomal breathing, because in the pres- 
ence of OCT4 alone, motifs on the left half of 
the nucleosome (SHLs -6.5 to —6.0) are not 
bound across 5 bp, whereas adjacent motifs more 
proximal to the dyad are tightly bound (Fig. 1, B 
and D, and fig. S2H). These entry-exit site loci 
are unbound by OCT4 alone, but they are 
bound cooperatively with SOX2 (Fig. 1D and 
fig. S2H). Thus, spatial orientation of the motif, 
cooperativity, and nucleosomal breathing dy- 
namics all govern OCT4-SOX2 binding. 
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Structure of OCT4-SOX2 bound at SHL -6 
shows DNA release from the histone core 

To dissect mechanisms of nucleosome engage- 
ment, we performed structural studies with 
OCT4-SOX2, which co-bind key target genes 
in vivo and have a cooperative effect in vitro 
(fig. $2, I to L) (22). For structural studies, we 
focused on sites that show discrete OCT4 SeEN- 
seq enrichment and cooperative binding in the 
presence of SOX2 (fig. S3A). A site 57 bp from 
the dyad (SHL -6) enabled structure determi- 
nation from ~90,000 particles at an overall 
resolution of 3.1 A (Fig. 2, A and B, and fig. 3). 
The nucleosome core and discrete domains of 
OCT4 [POU-specific (POUS)] and SOX2 [high- 
mobility group (HMG)] were well resolved (see 
fig. S3F for local resolution), which allowed 
conservative refinement with reference re- 
straints to existing high-resolution OCT4 and 
SOX2 DNA-bound crystal structures (Fig. 2, B 
and C; fig. $4; and fig. S5, A to D) (18, 19). 

In the OCT4-SOX2-NCP*"™* structure, OCT4: 
and SOX2 together remove the DNA from core 
histones (Fig. 2, C and D). OCT4 has a bipartite 
DNA-binding domain composed of a POUS and 
POU-homeodomain (POUHD) separated by 
17 residues, whereas SOX2 utilizes an HMG 
domain (/8). When nucleosome-bound, the 
OCT4-POUS and SOX2-HMG DNA-binding 
domains engage major and minor grooves, 


Free DNA 
SOX2 OcT4 
HMG POUs POUHD 


Motif CTTTGTTATGCAAAT 


HMG POUs H2A:B 
Nucleosome 


DNA base (T143) by 12.7 A as compared with the unbound nucleosome. (D) Dyad 
view of the OCT4-SOX2-NCPS"* structure. (E) In the depicted OCT4-SOX2 
arrangement, a model of POUHD engagement with its motif shows significant clash 
with the H2A:H2B dimer. Overlay of free DNA-bound structure (PDB: 104X) with the 
nucleosome-bound structure, aligned on the DNA (see also fig. SSE). (F) Schematic 
of free DNA versus the observed nucleosome-binding mode of OCT4-SOX2. 
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Fig. 3. OCT4-SOX2 lifts the entry-exit DNA 
away from the histone core. (A) Comparison 
of the unbound nucleosome DNA (blue) 

with the OCT4-SOX2 bound nucleosome 
structure (gray). The DNA is kinked 

~90° away from the histones. Residues at 
the OCT4-DNA interface are shown as sticks. 
POUS motif nucleotides are shown as 

ibose and base rings. (B) SOX2 kinks the 
nucleosomal DNA away from the histones. 
SOX2 uses conserved (Phe*® and Met*?) 
esidues to intercalate a TT base step, 
indicated by arrows (see also fig. S8A). 
OCT4 is removed for clarity. (€) Difference 
in DNasel digestion across the nucleosome 
in the presence of OCT4 or OCT4-SOX2 
(right). Zoom-in view around the OCT4-SOX2 
motif (left). 


Fig. 4. OCT4-SOX2 bound at SHL +6 induces 
localized DNA distortion. (A) Depiction of 
OCT4-SOX2 SeEN-seq profile illustrating 
symmetric positions used for structure 
determination (arrows). As the OCT4-SOX2 
motif is tiled across the nucleosome, the 
orientation of the proteins in relation to the 
dyad is inverted. (B) OCT4-SOX2-NCPSHL*® 
model. (€) Details of SOX2-induced DNA 

kink. SOX2 binding locally distorts DNA near 
SHL +5. The C terminus of SOX2 approaches 
the histone core when bound in this orientation, 
distorting DNA away from the stabilizing 

H2B residues [N terminus (Arg*’) and al]. 
The 7-A movement indicated is calculated 
between the phosphate backbone at Cys!@° 

in the bound and unbound DNA. SHLs are 
shown (SHLs +4.5 and +5.5). (D) Comparison 
of the DNA trajectory in the OCT4-SOX2- 
NCPS*“*6 (SHL +6) (top) versus OCT4-SOX2- 
NCPSH--6 (SHL -6) (bottom) structures. The 
DNA trajectory at the entry-exit site of the 
TF-bound structures is indicated with an arrow. 


respectively, with protein-DNA interactions con- 
sistent with those previously seen for the indi- 
vidual OCT4-POUS domain and SOX2 on free 
DNA (fig. S5E). The DNA remains attached 
and straightened around the OCT4 site but is 
detached around the SOX2 motif (Fig. 2 and 
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fig. S5F). SOX2 kinks the DNA and, together 
with OCT4, synergistically releases DNA from 
the core histones (movie S1). OCT4-SOX2 forms 
no discernable histone contacts, with the DNA 
separating them from the nucleosome core 
(Fig. 2, C and D). Density for the OCT4 trans- 
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unbound 
DNA 


activation domains was not observed, which is 
consistent with similar nucleosome-affinity 
measurements for full-length OCT4 and OCT4 
DNA-binding domain only (fig. S5G). 

OCT4 recognizes a partial motif, engag- 
ing DNA with its POUS domain, whereas the 
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Fig. 5. The HMG-POUS partial motif is 
bound in vivo and opens chromatin. 

(A) Fraction of the top thousand motifs, 
ranked by match to the position weight motif, 
that are bound by Oct4 (2-fold ChIP-seq 
enrichment). Mean and individual values from 
two separate datasets are shown (33, 34). 
Significant enrichment is detected for full and 
partial motifs compared with loci without a 
motif (lowest 1000 genomic matches to the 
Oct4-Sox2 motif). **P < 0.01; ****P < 
0.0001. (Pearson's chi-square test, indicated 
for dataset with least significance). (B) Sche- 
matic representation of targeted motif 
insertion and Oct4 ChIP-qPCR enrichment 
at the ectopic insertion locus and endoge- 
nous control (Cont.) locus. *P < 0.05; 

error bars indicate SEM of at least two 
biological replicates. (C) Representative 
genomic region (chr9: 115723244 to 
15725243) that shows binding of Oct4 to 
a partial HMG-POUS motif (top) and 
Oct4-dependent chromatin accessibility 
(bottom). Expr., Oct4-expressing cells; KD, 
Oct4-knockdown cells. (D) Graphical repre- 
sentation of the nucleosome distortion 
induced by OCT4-SOX2 when bound 
at dyad-symmetric sites. 


POUHD is not engaged. On free DNA, both 
POU domains engage the major groove span- 
ning 8 bp on opposite sides of the DNA and 
would clash with either the histones or DNA 
gyres at all locations on the nucleosome (79, 28) 
(Fig. 2E and fig. S6A). To stabilize the OCT4- 
SOX2-NCPS"™~* complex for imaging, GraFix 
cross-linking was necessary and a cross-link 
was evident between H2A and H2B (fig. S6B) 
(29). To test if POUHD access was blocked by 
the cross-linking of histones, we solved an 
~4-A map of the non-cross-linked OCT4-SOX2- 
NCPS#“-6 nucleosome (fig. $7, A to E), which 
resulted in a largely indistinguishable mod- 
el (root mean square deviation, 1.3 A; fig. $7, 
F and G). As the OCT4 POUHD motif is oc- 
cluded by H2A-H2B, the binding mechanism 
involving only the OCT4 POUS is consist- 
ent with partial motif engagement (Fig. 2F) 
(1). Partial motif recognition, however, does 
not necessarily render TF binding compat- 
ible with the nucleosomal DNA structure 
(fig. SSF). 

In the context of the nucleosome, SOX2 com- 
petes with histones for DNA binding and kinks 
DNA by ~90° at SHL -6.5 away from the his- 
tones, similar to HMG domains on free DNA 
(Fig. 3, A and B, and fig. S8A) (78). This is 
accomplished by intercalation of the SOX2 
Phe*® and Met*® wedge at the TT base step 
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(18). Variant SOX2 motifs that lessen distor- 
tion of the DNA induced by SOX2 have been 
described and may facilitate nucleosome- 
compatible dyad binding (J, 10, 30). How- 
ever, with the canonical SOX2 motif used here, 
SOX2 facing the entry-exit site is not compati- 
ble with the canonical nucleosome architec- 
ture and triggers DNA release (37). 

Despite disruption of histone-DNA contacts 
at SHL -6.5, no histone rearrangements were 
observed after OCT4-SOX2 binding and DNA 
release across 14 bp (fig. S8, B and C). To verify 
OCT4-SOX2 binding and DNA release using 
an orthogonal approach, we performed deoxy- 
ribonuclease I (DNasel) footprinting in the 
absence and presence of OCT4 or OCT4-SOX2. 
This revealed increased digestion at the nu- 
cleosomal entry-exit site (SHLs —7 to —5.5) in 
the presence of OCT4-SOX2, in line with DNA 
detachment and partial motif binding (Fig. 
3C and fig. S9, A and B). Notably, OCT4 alone 
triggers DNA release, and OCT4 and OCT4- 
SOX2 also destabilize the opposite nucleo- 
somal entry-exit site (SHLs +5.5 to +7), which 
is also evident in the cryo-electron micros- 
copy (cryo-EM) map (Fig. 2 and fig. S3). The 
structures, footprinting profiles (Fig. 3), and 
thermal stability assays (fig. S9C) together 
support the idea that OCT4 and OCT4-SOX2 
release DNA from the histones and have 
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a global effect on the DNA structure of the 
nucleosome. 


OCT4-SOX2 bound at SHL +6 induces minimal 
distortion to nucleosomal DNA 


The SeEN-seq profile suggests that OCT4-SOX2 
engagement depends on motif orientation (Fig. 
1B). To examine this structurally, we utilized 
the same position but with the motif inverted 
across the dyad axis, i.e., SHL +6 (Fig. 4A). 
Doing so places the ~90° kink-inducing SOX2 
motif in a dyad-proximal orientation and OCT4 
closer to the entry-exit site. The SHL +6 site was 
enriched for OCT4-SOX2 binding in SeEN-seq, 
and the use of this position enabled cryo-EM 
structure determination of an OCT4-SOX2- 
NcpS"™"*6 complex at an overall resolution of 
3.5 A (Fig. 4B and fig. S10; see fig. SIOE for 
local resolution). The map allowed unambig- 
uous rigid body docking of the nucleosome 
and of SOX2 (figs. S11 and S12). OCT4 (POUS) 
density was less continuous but sufficient 
to dock a Co model (fig. S11D). The resulting 
OCT4-SOX2 interface was consistent with pre- 
viously determined structures (Fig. 2, fig. SIC, 
and materials and methods). 

In the structure, OCT4 engages its motif 
with the POUS domain only (Fig. 4B), and the 
POUHD is unable to access its motif in the 
observed DNA configuration (fig. S13). OCT4- 
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SOX2 together give rise to an extended DNA- 
binding surface across 11 bp that further bends 
the nucleosomal DNA by ~90° at the SOX2 
site (SHL +5) and straightens the DNA near 
the OCT4 site (SHL +6), producing an L-shaped 
DNA arrangement (Fig. 4B). This conforma- 
tion locally lifts the duplex away from the his- 
tones and DNA gyre, but, in contrast to the 
reversed orientation, does not fully release 
the DNA from histones at the entry-exit site. 
At the SOX2 motif with the DNA locally de- 
tached, SOX2 helices 1 and 2 widen the minor 
groove, and the C terminus (residues 110 to 
114) wedges between the DNA and histones 
(SHL +5) (Fig. 4C). Despite partial engage- 
ment of an internal-nucleosomal motif, the 
SOX2 and OCT4 (POUS) DNA interactions 
and induced DNA distortions are again sim- 
ilar to those previously seen on free DNA (fig. 
S11C). Within one helical turn, on either side of 
OCT4-SOX2, these DNA distortions are largely 
absorbed into the canonical DNA trajectory 
of the nucleosome (Fig. 4D). The histone core 
architecture again shows no substantial dis- 
tortion. The DNA at the opposite nucleosomal 
entry-exit site (SHLs —7 to —5.5) appears to be 
disordered in the cryo-EM map (fig. SIOE and 
fig. S14). In both structures (Figs. 2 and 4), 
OCT4 binds a partial DNA motif through its 
POUS domain and, along with SOX2, affects 
the entire nucleosomal DNA structure to vary- 
ing extents. 


The HMG-POUS partial motif is sufficient 
for TF engagement and the opening of 
chromatin in vivo 


Previous work has identified that either Oct4 
or Sox2 alone engage reduced-complexity mo- 
tifs on nucleosomes during reprogramming 
(7). A recent study has also identified weaker- 
scoring motifs for SOX proteins on nucleosome- 
sized fragments (32). The structures now 
reveal that OCT4-SOX2 partial motif engage- 
ment is utilized in both orientations on the 
nucleosome. This led us to test whether OCT4- 
POUS and SOX2-HMG domains are sufficient 
to engage chromatin in vivo (Fig. 5A and fig. 
S15A). Through in-depth analysis of existing 
chromatin immunoprecipitation sequencing 
(ChIP-seq) datasets (33, 34), we found that 
the partial HMG-POUS motif is sufficient to 
drive genomic binding, although the full motif 
was bound more frequently (Fig. 5A). To test 
this experimentally, we introduced full and 
partial motifs at a defined genomic position 
in mouse embryonic stem cells (mESCs) by 
recombination-mediated cassette exchange 
(35) (Fig. 5B). Motifs were introduced in the 
SHL -6 position of the 601 sequence (see 
materials and methods), and Oct4 binding 
was determined. This revealed significant 
Oct4 enrichment at both the full and par- 
tial motifs but not in the control (Fig. 5B 
and fig. SI5B). Thus, single motifs recapit- 
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ulate genome-wide Oct4 binding to partial 
motifs. 

Next, we asked if binding creates open chro- 
matin. Comparing accessibility in mESCs 
(36) revealed that full and partial motifs can 
both generate accessible chromatin in an Oct4- 
dependent manner (Fig. 5C and fig. S15, C and 
D). Consistently, the same effect is evident 
upon knockdown of Oct4 or Sox2, which shows 
that both TFs are required for full accessi- 
bility at these loci (37) (fig. S15, E to H). This 
confirms the cooperativity observed in SeEN- 
seq; however, the local effect is expected to be 
highly context dependent, as additional pro- 
teins contribute to accessibility in vivo (36). 
We interpret our structures to depict an initial 
encounter complex between OCT4-SOX2 and 
the nucleosome en route to open chromatin. 
Upon nucleosome removal, OCT4 is expected 
to engage a full motif with its two POU do- 
mains, thereby accounting for the stronger 
enrichment of the full versus the partial motif. 
Together, genome-wide binding, single-locus 
insertion, and genome-wide accessibility data 
demonstrate that the OCT4-POUS domain is 
sufficient to engage and open chromatin in 
conjunction with Sox2. This reveals the poten- 
tial for such nucleosome-compatible motifs to 
function as bona fide binding sites beyond the 
ability to initially engage closed chromatin. 


Discussion 


The structures illustrate binding mechanisms 
of OCT4-SOX2 at two positions on the nucleo- 
some. At SHL +6, the structures depict OCT4- 
SOX2 near the entry-exit sites, where both TFs 
cooperate to access DNA. At the SHL -6 site, 
where SOX2 faces the entry-exit site, OCT4- 
SOX2 releases the DNA duplex from the his- 
tones. In the OCT4-SOX2-NCP*""*° structure, 
where SOX2 faces the dyad, the nucleosomal 
DNA assumes an L-shaped trajectory and is 
not fully released from the histones (Fig. 5D). 
The SHL -6 structure demonstrates that par- 
tial motif recognition and DNA release are not 
mutually exclusive (Figs. 2 and 3), whereas the 
SHL +6 structure depicts how more-internally 
bound sites can be accommodated without 
full removal of nucleosomal DNA ends (Fig. 4). 
We consistently find only the OCT4 POUS 
engaged, with the POUHD motif occluded by 
the nucleosome architecture. The OCT4 POUS 
and POUHD domains could, in principal, en- 
gage the full OCT4 motif if the DNA was fur- 
ther unwrapped from the histones, which we 
do not observe in our structures (Figs. 2 and 4) 
or DNasel experiments (Fig. 3C). Thus, partial 
motif recognition allows TFs to minimize DNA 
unwrapping when engaging nucleosomal sites, 
although partial motifs do not fully preempt 
nucleosome distortions. We show that partial 
OCT4 motifs, in conjunction with SOX2, are 
recognized in vivo and create open chroma- 
tin (Fig. 5). 
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SeEN-seq binding profiles, combined with 
the structural data, allow us to rationalize 
OCT4-SOX2 engagement throughout the nu- 
cleosome. OCT4 and OCT4-SOX2 bind best at 
the nucleosomal entry-exit sites, where DNA 
breathing is expected to facilitate access. We 
observe distinct exceptions from such end- 
binding behavior for OCT4 in particular, where 
narrow regions of pronounced binding are 
juxtaposed to nonengaged regions. To corre- 
late these accessibility profiles to the structure, 
we computationally translated isolated OCT4 
POUS or POUHD domains along the DNA of 
the unbound nucleosome and calculated pre- 
dicted atomic clash scores at each position (fig. 
S16). A comparison of OCT4 SeEN-seq data to 
the POU domain nucleosome-clash scores 
revealed that the solvent accessibility of the 
POUS—but not of the POUHD—correlates with 
OCT4 binding (fig. S16, A and B). The solvent 
accessibility for OCT4 POUS is also a good pre- 
dictor for OCT4-SOX2 engagement (fig. S16, C 
and D). The presence of SOX2 enables tight 
OCT4-SOX2 binding at the nucleosome ends 
but has a limited effect on more-internal sites 
(Fig. 1). At nucleosome ends, SOX2 can also 
drive binding at motifs where the OCT4 POUS 
is inward-facing and OCT4 alone does not 
bind (fig. S2H). Our structural and functional 
findings are consistent with a model where 
cooperative binding between OCT4 and SOX2 
not only strengthens DNA binding affinity but 
also triggers additional DNA distortions that 
must be accommodated. These distortions are 
better tolerated at the entry-exit sites, where 
nucleosomal DNA is more loosely bound (38). 
Whereas partial motifs delimit the TF footprint 
and DNA unwrapping, the available structures 
(13) show that protein domains bound to nu- 
cleosomal DNA retain their free DNA-binding 
mode (Figs. 2 and 4). TF-induced DNA distor- 
tions intrinsically destabilize the nucleosome 
(10), which likely facilitates the binding of 
additional factors and disrupts the internu- 
cleosomal interactions of higher-order chro- 
matin (15, 39) (fig. S17). 

The OCT4-SOX2 structures and the accom- 
panying in vitro and in vivo evidence provide a 
framework by which TFs use nucleosomal DNA 
distortion and not histone rearrangement to 
access parts of their motif. The degree of DNA 
distortion imposed on the nucleosome architec- 
ture depends on the position of the motif. Our 
structures suggest principal recognition mech- 
anisms for nucleosome-incompatible TFs as 
well as for those TFs accommodated on the nu- 
cleosome without DNA release, illustrating how 
TFs can read out chromatinized binding sites. 
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Mass-resolved electronic circular dichroism 


lon spectroscopy 


Steven Daly’, Frédéric Rosu’, Valérie Gabelica’* 


DNA and proteins are chiral: Their three-dimensional structures cannot be superimposed with 

their mirror images. Circular dichroism spectroscopy is widely used to characterize chiral compounds, 
but data interpretation is difficult in the case of mixtures. We recorded the electronic circular dichroism 
spectra of DNA helices separated in a mass spectrometer. We studied guanine-rich strands having 
various secondary structures, electrosprayed them as negative ions, irradiated them with an ultraviolet 
nanosecond optical parametric oscillator laser, and measured the difference in electron 
photodetachment efficiency between left and right circularly polarized light. The reconstructed circular 
dichroism ion spectra resembled those of their solution-phase counterparts, thereby allowing us to 
assign the DNA helical topology. The ability to measure circular dichroism directly on biomolecular ions 
expands the capabilities of mass spectrometry for structural analysis. 


wo centuries ago, the interaction of 

polarized light with crystals revealed 

that many molecules come in two forms, 

nonsuperimposable with their mirror 

images (1). The activity and toxicity of 
natural or synthetic molecules often depend 
on their chirality (2). Identifying which mirror 
image is present in a sample is thus crucial. 
Furthermore, because biomolecules such as 
DNA or proteins consist of repetitive chiral 
subunits, they can form helical structures, as 
in the iconic DNA double helix (3, 4). Charac- 
terizing the types of helical structures formed 
by biomolecules is therefore also essential for 
structural biology. 

Mass spectrometry (MS) is a widely used 
analytical method with expanding impact in 
structural biology (5, 6). MS excels at separat- 
ing and quantifying complex mixtures, and 
additional structural characterization can be 
obtained using tandem MS (7), ion mobility 
spectrometry (8), or infrared ion spectroscopy 
(9-11). However, MS-based measurements are 
typically blind to chirality. Characterizing chiral 
compounds by MS currently requires a physical 
interaction with other chiral molecules (72), 
either by separation on chiral phases in front 
of the mass spectrometer or by the formation 
of complexes with chiral auxiliaries in the gas 
phase (73). 

Another way to characterize chiral biomol- 
ecules is through interaction with chiral light. 
In solution, DNA and proteins are conveniently 
characterized by circular dichroism (CD), which 
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measures the difference of absorption between 
left and right circularly polarized light. Elec- 
tronic CD is particularly useful to character- 
ize the different types of helices formed by 
nucleic acids (74) or the secondary structures 
formed by proteins (75). The structural inter- 
pretation relies on spectral databases of known 
structures. However, the interpretation of CD 
spectra becomes difficult for samples wherein 
multiple species or structures coexist, because 
the resulting spectrum is the weighted average 
of all contributions. 

Here, we report a method for recording 
electronic CD spectra on DNA helices sep- 
arated in a mass spectrometer. Although CD 
and MS have been combined before (16-18), 
the previous methods were based on reso- 
nant multiphoton ionization (REMPI) and 
were thus applicable only to volatile neutral 
small molecules (<200 Da). We used elec- 
trospray to produce gas-phase polyanions 
of intact DNA multihelices (>5000 Da) and 
interrogated them with circularly polar- 
ized ultraviolet light directly inside the mass 
spectrometer. 

Our mass analyzer is a quadrupole ion trap 
(Paul trap) with two opposite holes (diameter 
1.7 mm) in the ring electrode. We reasoned 
that this configuration would minimize risks 
of reflection of the laser beam inside the mass 
spectrometer, which could possibly alter the 
circular polarization. The electrospray source 
was operated in the negative mode to produce 
multiply charged anions. We used gentle ion 
transfer conditions to prevent gas-phase re- 
structuring and thus maximize the chances 
of preserving the solution-phase secondary 
structures. 

We analyzed several DNA G-quadruplex 
tetrahelical structures—parallel right-handed 
tetramer TG4T [(dTGGGGT).,°(NH,")3], 
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parallel left-handed intramolecular ZG4 
[dT(GGT),TG(TGG)s3TGTTe(NH,")3] (19), 
antiparallel right-handed intramolecular 5YEY 
[d(GGGTTA),GGGTTTGGG»(K"*).] (20)—and 
the silver-mediated parallel G-duplex GAgG 
[(dGy1)o°(Ag")] (21). These structures were 
chosen because they have remarkably different 
electronic CD spectra in solution in the wave- 
length range 220 to 300 nm (fig. S1). Also, 
these structures are stabilized by spines of 
central cations, which maximizes the likeli- 
hood of preserving the hydrogen-bonding and 
stacking arrangement in the gas phase while 
minimizing rearrangements due to self-solvation 
(21). The preservation of the known solution- 
phase shapes was confirmed by the good agree- 
ment between helium momentum transfer 
collision integrals measured in ion mobility 
spectrometry and those calculated for gas-phase 
structures obtained by molecular dynamics at 
the semi-empirical level (fig. S2). For TG4T, 
infrared ion spectroscopy also supported the 
preservation of G-quartets (22). We can thus 
safely assume that these molecular systems 
have the same base-stacking arrangement in 
the gas phase as in solution. 

When coupled to MS, ion spectroscopy is an 
action spectroscopy (and not an absorption 
spectroscopy): One records the effect of the 
laser irradiation on the ions. Another reason 
for selecting DNA as the first test case is that, 
when irradiating DNA polyanions between 
220 and 300 nm, the resulting action is electron 
photodetachment (ePD) (23, 24). This action 
has two advantages over fragmentation: (i) 
There are few product ions to quantify, which 
in terms of statistics will increase the chances 


energy 
meter shutter 


VA 


4 — OPO laser (230—295 nm) 
> fused silica blade 


Rochon prism 


[DNA]>- 
1/4 waveplate 
[DNA]*- 
mass 1000 1250 1500 1750 2000 
spectrometer miz 


Fig. 1. Experimental setup for generating 
circularly polarized laser pulses. Also shown 

is the typical mass spectrum after isolation 

of D-[(ATGGGGT)4*(NHq*)3]°- ions and irradiation 
with left circularly polarized (LCP) light at 260 nm. 
See supplementary materials for setup details. 
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of detecting differences in product ion yields 
between the two polarizations; and (ii) ePD is 
monophotonic (25), and thus the normaliza- 
tion for fluctuations of the laser power is 
simply linear for all wavelengths. We verified 
this linear relationship, and for all spectros- 
copy experiments we selected the pulse energy 
ranges in which only linear ePD was observed 
(fig. $3). 

Nanosecond laser pulses were generated 
using a wavelength-tunable optical parametric 
oscillator laser (nanosecond pulses, <100 uJ 
transmitted to the trap). To generate circularly 
polarized laser pulses, the beam passes through 
an air-spaced Rochon prism, which gives pure 
linearly polarized light. Next, the laser passes 
through an achromatic broadband quarter- 
wave plate (Fig. 1, figs. S4 to S6, and supple- 
mentary materials). The angle of the fast axis 
of the quarter-wave plate relative to the polari- 
zation direction set by the Rochon prism deter- 
mines the final polarization state of the laser 
pulse. The laser enters in the ion trap through 
a fused silica window in the vacuum mani- 
fold. The percentage of circular polarization 
was greater than 95% (fig. S7 and table S1). 
The ions are randomly oriented inside the 
Paul trap. 

The CD of ions of a given mass-to-charge 
ratio m/z is measured by isolating the ions; 
irradiating them with a single laser pulse of 
the selected wavelength, polarization, and 
pulse energy; and then recording their mass 
spectrum (Fig. 1). Mass spectra and pulse en- 
ergy are averaged for 90 s. The polarization 
state of the laser pulse is then changed by 
rotating the quarter-wave plate, and the mass 
spectrum and pulse energy are acquired for 
another 90 s. This process is repeated 10 times 
for each polarization. The relative electron de- 
tachment yield for each mass spectrum is 
calculated as 


Tepp /Ttotai 


¥; = 
ePD AEcane 


(1) 


where J,pp is the integrated intensity of the 
peak due to electron detachment, Jota) is 
the total integrated intensity (precursor + 
ePD product), 4 is the wavelength, and Eirans 
is the transmitted pulse energy determined 
from the reflected pulse energy by calibra- 
tion (fig. S8). We then calculate the average 
value of the relative electron detachment 
yield for left- and right-handed circular po- 
larizations. The CD monitored by ePD is 
expressed as an asymmetry factor, approxi- 
mated by 


AePD yucp se yRcp 
n= Sy =x HEE) 
ePD Yoop + Yorp 


where LCP and RCP are left and right cir- 
cularly polarized light, respectively. The gas- 
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phase CD spectra are reconstructed by plotting 
the asymmetry factor gepp as a function of the 
wavelength. To facilitate the visual compari- 
son, we plot the solution-phase CD spectra as 
AA /A (where A is the absorbance), hence their 


100% 


L-TG4T 


shape differs from the ones traditionally dis- 
played in molar CD. 

To unambiguously prove that the gas-phase 
CD effect comes from the sample and not from 
an instrumental artifact, we performed the 
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Fig. 2. Gas-phase circular dichroism spectra (AePD/ePD) compared to solution-phase spectra 
(AA/A). Symbols and lines denote gas-phase and solution-phase CD data, respectively. (A) TG4T: natural 
right-handed o-[(TGGGGT)4*(NHa*)3]° (blue) and its mirror image L-[(TGGGGT)4*(NHa*)3]° (red). Inset: 
Gas-phase CD measured at 260 nm for [(TGGGGT)4*(NH,")3]°” ions prepared from 5 uM solutions with 
varying ratios of p-TG4T and L-TGA4T. Error bars show the 95% confidence interval calculated from the 
standard error (three replicas, except at 260 nm with five replicas). (B to D) Gas-phase and solution-phase CD 
spectra for the antiparallel G-quadruplex 5YEY> (B), the left-handed G-quadruplex ZG4° (C), and the G-duplex 
GAgG® (D). The illustrations showing helicity were generated using Chimera from Protein Data Bank 
coordinates (entries 204F, 5YEY, 6GZ6, and 4JRD, respectively); the gas-phase structures are shown in fig. S2. 
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experiment on p-[((TGGGGT),*°(NH,")3] formed 
from the natural DNA backbone (all p-sugars) 
and from its enantiomer (all L-sugars). The gas- 
phase CD signals have opposite signs (Fig. 2A). 
The magnitude is not exactly reversed, however. 
At 260 nm, we recorded a series of CD measure- 
ments by varying the relative concentration 
of the two enantiomers (p:L) in solution from 
only p to only t (Fig. 2A, inset). We obtained 
a straight line (r* = 0.975), which shows that 
the differences in ePD with left and right 
circularly polarized light are reporting on CD; 
this finding suggests that the technique could 
be suitable for quantification. The line did not 
pass exactly through 0 when the proportions 
were 50:50, indicating the presence of re- 
sidual instrumental artifacts, likely because 
the polarization is only ~95% circular. In the 
future, use of an achiral internal standard 
could further reduce the uncertainty. Fur- 
thermore, if such imperfections vary with 
the wavelength, it would induce some dis- 
tortion in the CD spectra that might hamper 
structural assignment. We thus compared 
the solution-phase and gas-phase CD spectra 
for other typical DNA helices (Fig. 2, B to D; 
symbols for gas-phase CD, lines for solution- 
phase CD). The solution- and gas-phase CD 
spectral shapes are similar in terms of sign 
and position of the maxima and minima. 
This suggests that the following conditions 
are met: (i) The base-stacking pattern exist- 
ing in solution is preserved in the gas-phase 
ions. (ii) The gas-phase action reflects the 
absorption, in terms of sensitivity to the cir- 
cular polarization. In other words, the electro- 
nic excited states that are responsible for 
the CD effect also trigger ePD. (iii) Possible 
distortions due to imperfections in the pola- 
rization do not preclude assigning the base- 
stacking arrangement on the basis of spectral 
shape. As a result, the gas-phase CD spectra 
can unambiguously discriminate between 
the different guanine-rich oligonucleotide 
structures. 

The gas- and solution-phase asymmetry fac- 
tors mainly differ in their magnitude, which is 
consistently larger in the gas phase (up to 2%) 
than in solution (<0.4%). This has been ob- 
served previously for small neutral molecules 
measured by REMPI (17, 26-28), where magni- 
tudes of up to 25% have been observed (28). 
High CD signals are also observed in photo- 
electron CD (PECD) (29, 30), but in that case 
the measured property is the angular distribu- 
tion of the photoelectrons. Here, we measure 
the electron detachment yield, and ePD is the 
result of a resonant excitation above the detach- 
ment threshold (24, 25). Given that solution- 
phase spectra result from the CD in absorption, 
we conclude that the gas-phase CD effect in 
photodetachment yields is also due to the re- 
sonant absorption of circularly polarized light 
by a chiral molecule. It is thus puzzling that 
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the CD spectral shapes are similar although 
the magnitudes differ. 

Here we consider possible explanations. In 
small molecules, the solvent can affect the mag- 
nitude of CD (37), and thus a change in the 
dielectric environment from water to vacuum 
may change the magnitude of the CD spectrum. 
To test this hypothesis, we recorded solution- 
phase CD spectra of p-TG4T in mixtures of 
water up to 50% isopropanol (dielectric con- 
stant varying from 80.4 to 44.3). The mass 
spectra show that the G-quadruplex is intact, 
and no larger multimers are present (fig. 
S9). No change in the value of AA/A was ob- 
served (fig. S10), which suggests that the 
dielectric environment plays only a small 
role. We also recorded the gas-phase CD spectra 
for the 5-, 6-, and 7-ions of p-TG4T, reasoning 
that if intramolecular electric dipoles influence 
the electron ejection dynamics, the effect would 
increase with the charge state. We observed 
no significant change in either magnitude or 
shape of the CD spectra as a function of the 
charge state (fig. S11). We thus hypothesize 
that the origin of the larger gas-phase magni- 
tude comes from the different definition of 
asymmetry factors. In the gas phase, absorp- 
tion is revealed by electron detachment. If not 
all states that absorb result in electron detach- 
ment (25, 32) while most states that are re- 
sponsible for CD do, the denominator of the 
asymmetry factor is smaller in gas-phase CD 
action spectroscopy, and thus the asymmetry 


factor is larger. Because the electronic states 
responsible for the CD effect are most likely to 
be delocalized on the entire DNA helices, this 
may result in more efficient autodetachment 
after resonant excitation. Thus, although fu- 
ture work is needed to elucidate the origins 
and dynamics of ePD from DNA polyanions— 
for example, by measuring the photoelec- 
tron CD—the ePD action serendipitously 
revealed itself to be especially well suited to 
probe DNA higher-order structures by ion 
spectroscopy. 

All experiments above were carried out on 
pure samples, and tandem MS was used to 
select a charge state. Next, we used mass sep- 
aration to obtain the CD spectra of indi- 
vidual components from a mixture. Human 
telomeric sequences, consisting of TTAGGG 
repeats, can form several G-quadruplex to- 
pologies (33). At submillimolar potassium 
concentrations, complexes with 1 K* (two 
G-quartets) and 2 K* (three G-quartets) 
coexist (34). The 2-K* complex is fully formed 
at high KCl concentrations, but 1-K* complexes 
cannot be isolated in solution. We previously 
reconstructed solution-phase CD data for in- 
dividual K* binding stoichiometries (Fig. 3A) 
by deconvoluting the solution-phase CD signal 
of a K* concentration series, having determined 
the amount of each complex in separate MS 
measurements. Here we applied mass-resolved 
CD ion spectroscopy directly on the mixture. 
Because the [M+1K]” and [M+2K]’ complexes 


0.004 + 


Fig. 3. Application of mass-resolved circu- 
lar dichroism to a mixture of conforma- 
tions adopted by the human telomeric 
DNA sequence dTAGGG(TTAGGG)3. 

(A) Deconvoluted solution-phase CD spectra 
for the three-quartet (2 K*, orange circles) 
and two-quartet (1 K*, blue squares) 
complexes coexisting at KCl concentrations 


between 0 and 1 mM (34). The error bars are 
the standard uncertainties propagated from 
the original data; the lines were obtained by 
Savitzky-Golay smoothing. The insets show 
the stacking topologies deduced from the K* 


stoichiometry and the CD spectral shapes 


(light gray and colors denote anti and syn 
glycosidic bond angles, respectively). 

(B) Gas-phase CD spectra for the complexes 
+] [M+IK]}> (orange circles) and [M+2K]°~ (blue 
squares) co-isolated from a solution 
containing 400 uM KCI. The error bars are 
the standard uncertainties from the replicates 
(listed in table S2); the lines were obtained by 
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are close in m/z, they could be co-isolated (inset 
of Fig. 3B), and as their ™/z differs, the ePD 
efficiencies of both species could be monitored 
in a single experiment (this is also how we 
envisage using internal standards in the fu- 
ture). Each spectrum was averaged over 180 s. 
Although the magnitudes of gas-phase CD 
signals (Fig. 3B) differ from those of their 
solution-phase counterparts, their position 
and sign allow us to infer the G-quadruplex 
stacking topology using the same rules as in 
solution (35): The 2-K* complex has positive 
CD signals at 290 and 260 nm and a negative 
signal at 240 nm (signature of coexisting 
homo- and heterostacking), whereas the 1-K* 
complex has positive signals at 290 and 240 nm 
and a negative signal at 260 nm (signature of 
heterostacking). 

Our experiments show that it is possible to 
measure the CD of large DNA polyanions in 
the gas phase while exploiting the physical 
separation of components inside the mass 
spectrometer. The similarity between solution- 
phase and gas-phase spectra allowed us to 
distinguish the secondary structures of G-rich 
DNA, thereby extending the scope and capabi- 
lities of structural MS. In the future, proteins 
and their aggregates can be studied by the 
same approach, but overcoming signal-to-noise 
challenges will likely require using internal 
standards and alternative circularly polarized 
light sources giving access to shorter wave- 
lengths. For positive ions and small molecules 
in general, cold mass tagging could provide 
alternative monophotonic action channels that 
may be less dependent on the nature of the 
excited state. Our demonstration of feasibility 


Daly et al., Science 368, 1465-1468 (2020) 


thus opens new avenues to study diverse 
classes of chiral molecules while leveraging the 
separation capabilities of contemporary mass 
spectrometry. 
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SOLAR PHYSICS 


Meridional flow in the Sun’s convection zone is a 
single cell in each hemisphere 


Laurent Gizon'?>*, Robert H. Cameron’, Majid Pourabdian*’, Zhi-Chao Liang’, Damien Fournier’, 


Aaron C. Birch’, Chris $. Hanson? 


The Sun’s magnetic field is generated by subsurface motions of the convecting plasma. The latitude at 
which the magnetic field emerges through the solar surface (as sunspots) drifts toward the equator 
over the course of the 11-year solar cycle. We use helioseismology to infer the meridional flow 

(in the latitudinal and radial directions) over two solar cycles covering 1996-2019. Two data sources 
are used, which agree during their overlap period of 2001-2011. The time-averaged meridional flow is 
shown to be a single cell in each hemisphere, carrying plasma toward the equator at the base of 

the convection zone with a speed of ~4 meters per second at 45° latitude. Our results support the 
flux-transport dynamo model, which explains the drift of sunspot-emergence latitudes through 


the meridional flow. 


eat is transported by convective motions 

of the plasma in the outer 29% of the 

Sun (the solar convection zone). In this 

layer, convection interacts with rotation 

to drive global-scale axisymmetric flows 
(1). The longitudinal component of these flows 
is the solar differential rotation: The equator 
rotates once every 25 days, the poles once every 
34 days. The latitudinal and radial compo- 
nents are the Sun’s meridional flow. The differ- 
ential rotation and the meridional flow both 
play a role in the solar dynamo (2). Differential 
rotation acts on latitudinal magnetic field to 
generate a longitudinal (toroidal) magnetic 
field. At the surface, the meridional flow trans- 
ports the radial magnetic field from the equa- 
tor toward the poles. The role of the deep 
meridional flow is less certain. In the class 
of models known as flux-transport dynamos 
(3), the meridional flow near the base of the 
convection zone is assumed to be equator- 
ward and to transport the toroidal magnetic 
field to match the drift of 2° to 3° per year in 
the latitudes at which sunspots appear (2). 
Thus, these models provide testable predic- 
tions about the amplitude and sign of the deep 
meridional flow. 

Testing these predictions is challenging. The 
geometry of the meridional flow is difficult to 
compute theoretically from first principles, as it 
results from a small imbalance between two 
large terms (4). Mass conservation implies that 
the plasma is carried around closed loops (cells). 
A poleward flow at the surface must return 
equatorward at some depth. There may be 
additional closed cells stacked on top of each 
other in the radial direction. Both one-cell 
and two-cell meridional flow geometries have 
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been proposed on the basis of theory and nu- 
merical simulations (5). Observationally, the 
meridional flow can be constrained using 
helioseismology (6, 7). This technique relies 
on measurements of the times taken by solar 
acoustic waves to travel between points on 
the surface through the interior. Suitable data 
are available for 1996-2011 from the Michelson 
Doppler Imager (MDI) on the Solar and Helio- 
spheric Observatory (SOHO) spacecraft and for 
2010 onward from the Helioseismic and Mag- 
netic Imager (HMI) on the Solar Dynamics 
Observatory (SDO) spacecraft. Studies of the 
HMI data have reached differing conclusions 
on the geometry of the meridional flow: either 
one or two cells in the radial direction (8-11). 
The MDI data and the HMI data also show 
different flow structures (72). These differences 
may result from instrumental systematic er- 
rors, the calibration of the observations, and/or 
different assumptions made during the data 
analysis. 

To confirm the validity of a helioseismic in- 
ference, it is necessary to compare results from 
two independent datasets that have an extended 
overlap period. For example, the Sun’s internal 
rotation has been validated (13) by comparing 
observations from MDI to those of the ground 
stations operated by the Global Oscillation 
Network Group (GONG) (/4). 

Using multiple datasets, we studied the struc- 
ture and time variability of the meridional flow 
in the convection zone. We used maps of the 
line-of-sight velocity at the Sun’s surface (dop- 
plergrams) at reduced spatial resolution, which 
provide information about sound waves prop- 
agating in the solar interior with spherical har- 
monic degrees up to 300. The reduced-resolution 
data are known to be less prone to instru- 
mental errors (15). We considered all three 
main datasets: MDI, HMI, and GONG. The 
MDI observations consist of dopplergrams 
with 192 pixels by 192 pixels per frame for the 
period 1 May 1996 to 11 April 2011 (76). After 
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April 2003, the SOHO spacecraft was rotated 
180° every 3 months; we only used the obser- 
vations when MDI was in the orientation it 
had before 2003 to ensure consistency of the 
data (17). The HMI data (78) that we used cover 
the period from 1 May 2011 to 30 April 2019; 
they were processed by the HMI team to have 
a format (204 pixels by 204 pixels) similar 
to that of the MDI data. We constructed the 
third dataset from the merged and calibrated 
high-resolution GONG dopplergrams (19) for 
1 August 2001 to 30 April 2019 (839 pixels by 
839 pixels) by applying Gaussian smoothing 
and downsampling to 200 pixels by 200 pixels. 

We produced time series of MDI, HMI, and 
GONG images—remapped onto a heliographic 
coordinate system (scale of 0.6° per pixel)—by 
tracking areas on the solar surface at the 
Carrington rotation rate of 456.03 nHz (20). 
The MDI data were corrected to account for a 
misalignment of the instrument with respect 
to the spacecraft, corresponding to a 0.20° error 
in the solar P angle (one of the two angles de- 
scribing the direction of the Sun’s rotation axis 
with respect to the plane of the sky). This error 
was determined using HMI images as reference 
during the 2010 overlap period (27). The orien- 
tation of the HMI images is known to within 
0.015 based on the analysis of the transit of 
Venus from 5 to 6 June 2012 (22). The MDI, 
HMI, and GONG images were all corrected 
for a 0.08° difference between the inclina- 
tion of the solar rotation axis to the ecliptic 
and the value of 7.25° measured by Carring- 
ton in 1863 (20). 

The data were analyzed using time-distance 
helioseismology (23). Solar subsurface flows 
affect the time it takes for acoustic waves to 
travel between two points on the surface, A 
and B. The travel times were measured in 
both directions by cross-correlating the Dop- 
pler velocity data at A and those at B. The 
difference between the two travel times is 
sensitive to flows near the ray path that con- 
nects A and B through the interior. To de- 
termine the meridional flow, we considered 
points separated in latitude, using a quad- 
rant geometry with arcs of 30° [(/2) and fig. 
SI]. Points in locations of strong magnetic 
fields were excluded from the averages (17). 
By fitting a one-parameter model (24) to 
the cross-correlation functions computed 
daily, we measured the south-north travel- 
time differences 1(A, A), where 6° < A < 42° 
is the angular distance between the arcs and 
-54° < i < 54° is the latitude of the midpoint 
between the arcs. We applied a center-to- 
limb correction using travel-time differ- 
ences measured in the east-west direction 
(25) (fig. $2). 

Figure 1, A to C, shows the measured travel 
times averaged over three ranges of travel dis- 
tances at low latitudes in the north and south. 
The travel times are averaged in bins of 3 years 
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starting from 1 May 1996 to aid in the com- 
parison of the three datasets. For the period 
before 1 May 2002, we only show MDI data. 
For the periods thereafter, we included in the 
time averages only those days when travel 
times are available for two instruments: 
MDI (original orientation) and GONG until 
30 April 2011 and HMI and GONG thereafter. 
The signs of the measured travel times are 
consistent with a poleward meridional flow 
near the surface. The time variations of the 
travel times are caused by surface inflows 
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Fig. 1. Averages of the travel-time differences 
caused by the meridional flow. (A to C) The travel 
times were averaged in 3-year periods and over 
latitudes 10° < 4 < 25° (North) and -25° < 1 < -10° 
(South). Between 1 May 2002 and 30 April 2011, only 
the days when travel times for both MDI (original 
orientation) and GONG are available were included in 
the time averages. From 1 May 2011 onward, only the 
days when travel times for both HMI and GONG are 
available were included in the averages. Travel times 
were further averaged over angular distances from 

6° to 18° (A), from 18° to 30° (B), and from 30° to 42° 
(C). The error bars indicate tlc. (D) Monthly sunspot 
number versus time, displaying solar cycles 23 and 24. 
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into magnetic regions (26) and therefore 
are related to the sunspot number (Fig. 1D). 
The MDI and GONG observations during the 
period from 2002 to 2011 differ by ~0.1 s (Fig. 
1, A to C, and fig. S3), which is much smaller 
than the lo uncertainties caused by the ran- 
dom excitation of the acoustic waves. Taking 
noise correlations into account, MDI and GONG 
travel times are consistent for distances <30° 
[(20) and table S1]. However, the HMI travel 
times differ from the GONG travel times, with 
smaller values for HMI times in the northern 
hemisphere. We have been unable to identify 
the source of this discrepancy, so we do not use 
the HMI data for the rest of the analysis. We 
also considered data from the GONG network 
from 1 May 1996 to 31 May 2001, when it op- 
erated with lower-resolution cameras. The cor- 
responding travel times are consistent with 
the MDI data for the same period but noisier 
[(20) and fig. S4]. We therefore combine the 
MDI data from 1 May 1996 to 30 April 2003 
(before the first change in orientation of the 
SOHO spacecraft) and the GONG data from 
1 May 2003 to 30 April 2019. 

The south-north travel-time differences t are 
linearly related to the meridional flow through 
functions of position known as sensitivity kernels. 
We computed the kernels using a finite element 
solver in the frequency domain (27). Denoting 
the colatitude by 8 = 90° — A, we represented 
the radial and colatitudinal components of the 
meridional flow, U, and Us, as linear combi- 
nations of basis functions (20). The coeffi- 
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cients in these expansions are the parameters 
describing the flow, which were determined 
by inverting the travel times. We solved the 
linear inverse problem under the physical 
constraints of mass conservation and that 
the flow does not cross the convection zone 
boundaries (20). Inversions were validated 
using synthetic data (20). Eleven years of data 
allow us to distinguish between one- and 
two-cell flow profiles, with a noise level of 
~1.5 ms‘ at the base of the convection zone. 
For 3 years of data, the noise increases to 
2.5 ms’. These uncertainties are consistent 
with previous estimates (28). 

Figure 2A shows the inferred Up, in the con- 
vection zone, averaged over each solar cycle. 
The stream functions in Fig. 2B show that the 
flow takes the form of a single cell in each 
hemisphere. The flow is poleward at the sur- 
face (Fig. 3C) and equatorward at the base of 
the convection zone (Fig. 3A). The flow profile 
at the base of the convection zone approxi- 
mately follows Ug = U, sin 20, with U, = 4.8 + 
1.0 ms’ for cycle 23 and U, = 3.6+10ms7* 
for cycle 24 (the cycles are identified in Fig. 1D). 
The flow switches sign near 0.79Ro, where Ro 
is the solar radius (Fig. 3B). This is consistent 
with previous inversions that used the constraint 
of mass conservation (9, 11). 

The 3-year averages are shown in Fig. 2, C 
and D, and Fig. 3, D and E. The noise is higher 
than that for the ll-year averages. Except for 
the period between the two cycles, a single 
cell is evident in each hemisphere. At the 
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Fig. 2. Inferred meridional flow in the solar convection zone. (A) Flow Us for solar cycles 23 and 

24 (U, is shown in fig. S5). (B) Stream functions w for each solar cycle, defined by pU = V x [ow/(rsind)], 
where p is the density and 6 is the unit vector in the longitudinal direction. The stream functions are 
normalized to their maximum absolute value. (© and D) Temporal variations with a cadence of 3 years. 
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Fig. 3. Cuts through the solutions of the 
meridional flow. (A) Latitudinal dependence of 
Ug at the base of the convection zone. The 
purple and light blue solid curves show Us for 
solar cycles 23 and 24, respectively, with the 
shaded regions indicating lo. (B) Radial 
dependence of Ug at latitudes of +30° for each 
cycle. (C) Latitudinal dependence of Us at the 
surface for each cycle. (D) Time dependence of 
Up at latitudes of +30° at the base of the 
convection zone. The purple and light blue 
horizontal lines show the averages over 

cycles 23 and 24 (Fig. 1D), respectively, 

with the shaded regions indicating +lo. The 
dark blue and gray lines connect the 

3-year averages from MDI and GONG, 
respectively. Error bars indicate +lo. 

(E) Time dependence of Ug at latitudes 

of +30° at the surface. 
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Fig. 4. Sunspot-emergence locations in relation to the meridional flow. 
(A) Line-of-sight magnetic field contours at the surface (black curves), 
overlain on Ug averaged over the top 2% of the Sun by radius. The flow 
inversion used all travel distances, and variations on time scales faster than 
5 years were filtered out. Inflows into regions where the surface magnetic 
field is strong dominate the time dependence of Us. The dashed curve 
shows where Us, = 0. (B) The same contours of the magnetic field at the 
surface, overlain on Us averaged between the base of the convection 
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zone (0.713Rq) and 0.8Rq. (C) Surface longitudinal (east-west) 
magnetic field B,, from Wilcox Solar Observatory observations (30) and, 
superimposed with black lines, latitudes at which the subsurface longitudinal field from the flux transport model—based on the measured Ug—is maximum. The solid 
lines are the median and the dashed lines are the 16th and 84th percentiles, obtained from 300 realizations of the flow. 


surface, the variations of Us at 30° latitude 
are substantial and anticorrelated with the 
sunspot number (Fig. 3E). Figure 4A shows the 
surface Ug smoothed in time with a low-pass 
filter (5 years) and its relation to the latitudes 
where sunspots emerge. The changes with 
time and latitude can be understood as local 
inflows into magnetic regions, which are 
likely driven by horizontal pressure gra- 
dients caused by the surface magnetic field 
(26). In the middle of the convection zone (fig. 
S6), the meridional flow averaged over cycle 
24 is poleward in the north and very weak in 
the south. The 3-year averages show a decrease 
of the amplitude of U, from about 2004 during 
the decaying phase of solar cycle 23. This de- 
crease is seen in both the MDI and GONG data. 
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The agreement between the MDI (original 
orientation) and GONG data is better for an- 
gular distances of A < 30° than for A > 30° 
(Fig. 1). Travel distances <30° are capable of 
distinguishing between single- and double-cell 
models (12), so we also performed inversions 
for A < 30° only. The results (figs. S7 to S9) 
confirm the single-cell solution for each cycle 
for the MDI and GONG data. Additionally, the 
inversions for MDI and GONG restricted to 
the days in common and for separation dis- 
tances A < 30° are almost indistinguishable 
(fig. S10). 

In flux-transport dynamo models, the amount 
of longitudinal magnetic field created in the 
convection zone is determined by the dis- 
tribution of the radial magnetic field at the 
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Year 


surface (29). The longitudinal magnetic field 
is then transported by Ug near the base of the 
convection zone to produce the drift of lat- 
itudes at which sunspots emerge. To test this 
hypothesis, we used the one-dimensional mean- 
field equation governing the evolution of the 
longitudinal magnetic field (20), with Up from 
each solar cycle averaged from the bottom of 
the convection zone to 0.8Ro. Using a cycle av- 
erage for Up is appropriate because the temporal 
variations of Us deep in the convection zone 
(Fig. 4B) are consistent with noise (Fig. 3D). 
The latitude of the peak subsurface longitudinal 
magnetic field from the model shows an equa- 
torial propagation that is consistent with the 
equatorward drift of locations of strong surface 
field (Fig. 4C). The one-cell meridional flow in 
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each hemisphere that we observed is thus con- 
sistent with the equatorial migration of the 
sunspots under a simple flux-transport model. 
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Surfactant-assisted seeded growth of metal nanoparticles (NPs) can be engineered to produce 
anisotropic gold nanocrystals with high chiroptical activity through the templating effect of chiral 
micelles formed in the presence of dissymmetric cosurfactants. Mixed micelles adsorb on gold nanorods, 
forming quasihelical patterns that direct seeded growth into NPs with pronounced morphological 

and optical handedness. Sharp chiral wrinkles lead to chiral plasmon modes with high dissymmetry 
factors (~0.20). Through variation of the dimensions of chiral wrinkles, the chiroptical properties 

can be tuned within the visible and near-infrared electromagnetic spectrum. The micelle-directed 
mechanism allows extension to other systems, such as the seeded growth of chiral platinum shells 

on gold nanorods. This approach provides a reproducible, simple, and scalable method toward the 


fabrication of NPs with high chiral optical activity. 


hirality is a key signature of nature that 

can be found across length scales, from 

subatomic particles, through molecules 

and biological systems, to galaxies (1-3). 

Imparting handedness to selected mate- 
rials may provide important advantages in terms 
of their interaction with living organisms, 
allowing the development of enantioselective 
catalysts or devices with spin selectivity in 
electron transport (4-6). For example, the 
demonstration of chiral plasmon modes in 
noble metal nanocrystals (NCs) has drawn 
interest in the field of metamaterials and 
the design of enantioselective sensing probes 
(7-9). The fabrication of plasmonic nano- 
structures has thus become an active field of 
research (10-16). However, growing crystal- 
line noble metals with dissymmetric mor- 
phology is challenging (10-12). The formation 
of chiral plasmonic nanomaterials is usually 
achieved by assembling achiral plasmonic 
gold and silver nanoparticles (NPs) with mol- 
ecular templates such as DNA, proteins, or poly- 
meric fibers (13-15). Although lithographic 
approaches have been successfully applied 
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to fabricate chiral gold NPs, their scalability 
remains limited (9, 16). 

Advances in the colloidal synthesis of noble 
metal NPs achieved during the past decades 
suggest realistic prospects for the production 
of plasmonic NCs with diverse morphologies, 
including chiral ones (17-19). The presence of 
chiral amino acids during growth can guide 
the formation of NCs with distinct handed- 
ness. The most prominent example of this 
is the synthesis of helicoidal nanostructures 
assisted by cysteine and glutathione. The en- 
antioselective interaction of amino acids with 
chiral geometrical elements naturally appear- 
ing at particular NC facets, has been claimed 
to induce the observed shape evolution into 
twisted geometries with high dissymmetry 
factors (11, 20). 

In addition to chiral additives, the syn- 
thesis of colloidal chiral NPs requires surface 
ligands that prevent undesired aggregation 
(11, 21). However, most of these capping agents 
also play a role in the growth process and 
the resulting NC morphology (/7, 22). Among 
the wide variety of ligands used for the syn- 
thesis of noble metal NCs, quaternary alkyl- 
ammonium halide (CTAX, where X = Cl or 
Br) surfactants are some of the most exten- 
sively investigated. Through the adsorption 
of CTAX micellar aggregates on certain crys- 
tallographic facets, a wide variety of NP mor- 
phologies have been obtained, including 
nanorods, nanotriangles, and Platonic geo- 
metries (17-19, 22-24). 

Recently, the addition of cosurfactants such 
as aromatic molecules, fatty acids, or long- 
chain alkyl alcohols has been shown to fur- 
ther improve the quality of colloidal gold 
NPs, and of nanorods in particular (25-27). 
The ability of cosurfactants to intercalate 
within CTAX surfactant aggregates has been 
proposed to increase the rigidity of the mi- 
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celles, induce phase transitions, or both. The 
impact of cosurfactants on the aggregation 
behavior of surfactant molecules can also 
be extended to the formation of chiral mi- 
celles. For example, 1,1'-bi(2-naphthol) [BINOL], 
a cosurfactant with axial chirality (i.e., atropiso- 
merism), can induce the formation of chirop- 
tically active, giant, worm-like micelles (28). 

Considering the role of surfactants on the 
synthesis of gold NCs and the possibility of 
producing chiral micelles in the presence of 
BINOL-like cosurfactants, we envisioned the 
surfactant-assisted seeded growth of chiral 
gold NCs. We hypothesized that elongated 
rod-like particles would have the ideal shape 
to guide the adsorption of twisted, worm-like 
micelles on the gold surface into helix-like chi- 
ral structures. To investigate both hypotheses, 
we first performed a computational analysis 
on an ideal system comprising cetyltrimethy] 
ammonium chloride (CTAC) and BINOL. Mo- 
lecular dynamics (MD) simulations revealed 
that BINOL molecules induced the assembly of 
surfactants into chiral, worm-like aggregates. 
Such elongated micelles tend to coil around 
gold nanorods, which can thus be considered 
as templates for the seeded growth of aniso- 
tropic NPs with chiral features. 

Although experimental evidence of the 
proposed idea was obtained using BINOL 
as a cosurfactant, the results of chiral growth 
were largely improved by using its deriva- 
tive 1,1'-binaphthyl-2,2'-diamine (BINAMINE). 
The growth of plasmonic gold nanorods pat- 
terned with a complex chiral surface was 
indeed confirmed by high-angle annular 
dark-field scanning transmission electron 
microscopy (HAADF-STEM) tomography. By 
tuning the dimensions of the final nano- 
structures, high anisotropy factors (g-factor 
~0.20) were achieved within a wide wave- 
length range (from 500 to beyond 1350 nm). 
Theoretical modeling of the optical properties 
highlights the importance of growing well- 
defined chiral wrinkles to obtain intense cir- 
cular dichroism (CD) responses. 

MD simulations confirmed that in the pres- 
ence of BINOL as a cosurfactant, CTAC can 
form giant cylindrical micelles that span 
across the simulation boxes (i.e., up to 60 nm 
in length; Fig. 1A and fig. S1). Regarding their 
chiral nature, we observed that unit vectors 
between the local micellar center and the 
principal micellar axis were distributed heli- 
cally, with the handedness being dictated by 
the choice of cosurfactant stereoisomer. This 
effect can be shown by measuring the average 
angle formed between pairs of vectors (fig. S2). 
An achiral helix does not have any tendency to 
coil into a particular direction, so the angle formed 
between such vectors averages out to zero. 

In our simulations, the average angles not 
only were finite but also appeared sinusoi- 
dally correlated and displayed a well-defined 
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Fig. 1. MD simulations of BINOL-CTAC micelles and their assembly on gold nanorods. 

(A) MD Simulations for (R)-BINOL-CTAC in solution. The aggregation of CTA* (blue) in the presence of 
(R)-BINOL (red) results in chiral, worm-like aggregates spanning the whole simulation box 60 nm in 
length. (B) The shape of the micellar coil is described by a set of vectors along the principal micellar axis. 
The average angle between vectors is plotted as a function of their vertical distance for 60-nm-long 
rod-like micelles. The presence of (R)- or (S)-cosurfactant enantiomers in the micelles leads to coiling 
in opposite directions, as revealed by the sign of the average angle. (C and D) MD simulations for 
(R)-BINOL-CTAC adsorbed onto a gold nanorod in axial and lateral views, respectively. 


handedness (Fig. 1B). Because of the marked 
atropisomerism of BINOL, we can assume that 
it could induce helical chirality onto the sur- 
factant aggregates, likely by a “sergeants and 
soldiers” mechanism in which a chiral molecule 
directs many achiral molecules or atoms (29, 30). 
In this case, the adsorbed chiral micelles would 
assist chirality transfer during further deposition 
of metal atoms onto the nanorod surface. 

We tested this effect by simulating a gold 
nanorod immersed in an aqueous mixed solu- 
tion of CTAC and BINOL. Visual inspection 
of the simulation results revealed that CTAC 
and BINOL adsorbed into worm-like micelles 
coiled around the gold nanorod surface (Fig. 
1, C and D). Although no preferred orienta- 
tion of the assembly was observed by the use 
of different BINOL enantiomers in this case, 
in the absence of cosurfactant, the coil-like 
morphology was largely lost. Instead, small 
aggregates and a high fraction of individual 
surfactant molecules adsorbed on the gold 
surface (fig. S3). The simulations indicated 
that chiral worm-like micelles likely formed 
first and were subsequently adsorbed onto 
the gold nanorod. Thus, a complex cooperative 
chiral transfer mechanism would convey the 
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atropisomerism from BINOL into the giant 
micelles and ultimately to gold nanorods. 

On the basis of the simulation results, we 
implemented experimentally the growth of 
chiral nanorods. (R)-BINOL was solubilized 
by surfactant micelles in aqueous solution 
at an optimum molar ratio of 40 and used to 
template the growth of preformed gold nano- 
rods (130 + 10 nm long, 29 + 3 nm thick; for 
details, see Fig. 2A and the supplementary 
materials; note that (R)-BINOL is insoluble in 
water in the absence of surfactant). Prelim- 
inary characterization by CD spectroscopy 
revealed the appearance of moderate, chiral- 
localized surface plasmon resonance bands 
(g-factor ~0.002; figs. S4 and S5). HAADF- 
STEM analysis of the NPs grown in the (R)- 
BINOL and surfactant mixture revealed a 
complex morphology characterized by the 
presence of wrinkles on the nanorod surface 
(Fig. 2B and fig. S4). Previous theoretical and 
experimental studies demonstrated that in- 
tense CD responses are exhibited by NPs with 
distinct chiral distortions, stellated-like mor- 
phologies, or both (9, 11, 31). 

Encouraged by the appearance of plasmonic 
bands in the CD spectra, our efforts focused on 
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improving chiral seeded growth. We hypothe- 
sized that the use of stronger stabilizing agents 
would help to direct the growth of more struc- 
tured chiral wrinkles (32). We investigated 
(R)-BINAMINE, an analog of (R)-BINOL in 
which the hydroxyl functionalities are replaced 
by amine moieties. Although the interaction of 
amines with gold surfaces is moderately strong 
(Au-N bond energy ~6 kcal/mol), the presence 
of two amines in a chelating configuration 
should ensure a greater affinity (33, 34). MD 
simulations of the BINAMINE-CTAC system 
confirmed the formation of chiral, worm-like 
aggregates, similar to those for BINOL-CTAC, 
with a helical distribution of unit vectors (fig. 
S6). HAADF-STEM analysis of the particles 
obtained by seeded growth in (R)-BINAMINE- 
CTAC revealed the presence of gold nanorods 
displaying a highly complex surface mor- 
phology resembling an intricate network of 
wrinkles (Fig. 2C and figs. S7 to S10). Scanning 
electron microscopy (SEM) imaging of the 
same NPs also revealed a complex surface struc- 
ture (fig. S11). 

Notwithstanding, proper understanding 
of such complex structures demands a more 
detailed characterization than that provided 
by either HAADF-STEM or SEM, which only 
retrieve two-dimensional (2D) and surface 
information, respectively. We used HAADF- 
STEM tomography, which has been shown to 
be a powerful technique to investigate the 3D 
morphology of complex NCs. We prepared sim- 
ilar particles, but with increasing dimensions, 
using 130- x 29-nm gold nanorods as seeds and 
characterized them with HAADF-STEM (see 
the supplementary materials for synthetic de- 
tails). Electron tomography reconstructions of 
nanorods with the smallest degree of growth, 
i.e., 165 nm long and 73 nm thick, revealed the 
presence of sharp wrinkles oriented in a seem- 
ingly radial direction and displaying tilt angles 
between 0° and 45° with respect to the short 
axis (Fig. 2D). Detailed analysis of the internal 
structure suggested that the wrinkles grew 
from the surface of the gold nanorod seed in a 
quasiradial direction up to ~15 to 20 nm in 
height and with a constant width of 3 to 4nm. 
The average intergroove separation was mea- 
sured to be 2 to 3 nm (Fig. 2E), which is near the 
3.75-nm width of worm-like micelles obtained 
from MD simulations. This finding would sug- 
gest a micelle-directed chiral growth of the 
gold nanorods, as we initially hypothesized. 

The width of the wrinkles was compatible 
with the diffusion of metal ion-loaded CTAC 
micelles from solution (35), leading to reduction 
at available gold sites between (R)-BINAMINE- 
surfactant micelles on the nanorod surface. 
By varying the concentration of nanorod 
seeds in the growth mixture, chiral nanorods 
were obtained with increased dimensions of 
210 nm in length and 112 nm in thickness, and 
their surface was more entangled (Fig. 2F). 
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Nonetheless, the presence of tilted wrinkles 
was still visible and angles between 0° and 
45° were again observed [between 0° and 
-45° in the case of (S)-BINAMINE; fig. S9]. 
Analysis of the internal structure revealed 
wrinkles of ~35 to 45 nm in height and a 
similar width to those in the smaller rods 


Fig. 2. Growth of chiral gold nanorods in micellar systems and effects of 


size and shape. (A) TEM 


the presence of (R)-BINA 


Fig. 3. Qualitative analysis of chiral features. (A) Tomog- 


raphy reconstruction of a gold nanorod grown in an 
(R)-BINAMINE-surfactant mixture (see also movie S7). 


(B) Low-resolution HAADF-STEM image of chiral gold nanorods 
obtained in the presence of (R)-BINAMINE. Scale bar, 100 nm. 


(C) 3D FFT from which the inverse FFT was computed a 
segmentation (see also movie S10). (D) Inverse FFT (see 


also movie S12) showing the areas of the particle with chiral 


features overlapped with the projection of the tomograph 


indicating a right-handed angle. (E) Tomography reconstruction 


in HAADF-STEM (left) and EDX (right, Pt displayed in gre 


modes of an Au@Pt nanorod grown in an (R)-BINOL-surfactant 


mixture (see also movies S24 and S25). (F) HAADF-STE| 


image at low magnification of Au@Pt nanorods. Scale bar, 
100 nm. (G) 3D FFT from which the inverse FFT was computed 


after segmentation (see also movies S28 and S29). 

(H) Inverse FFT (see also movie S30) showing the areas 
the particle with chiral features overlapped with the proje 
of the tomography, indicating a right-handed lower angle. 
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image of gold nanorods 130 nm in length and 29 in 
width used as seeds. (B) High-magnification HAADF-STEM image of a chiral gold 
nanorod grown in (R)-BINOL displaying a complex surface containing wrinkles. 
(C) HAADF-STEM image at low magnification of gold nanorods obtained in 

INE displaying a complex surface containing sharp 
wrinkles. Scale bars, 200 nm [(A) and (C)] and 100 nm (B). (D to I) Gold 
nanorods of 165 x 73 nm [(D) and (E)], 210 x 112 nm [(F) and (G)], and 270 x 
175 nm [(H) and (1)] grown in (R)-BINAMINE-surfactant mixtures were analyzed 


(Fig. 2G). Finally, even larger chiral nanorods 
(270 nm long and 175 nm thick) were in- 
vestigated. Although the surface appeared 
more undefined in this case (Fig. 2H), anal- 
ysis of the internal structure showed the 
characteristic features described for the pre- 
vious samples (Fig. 21). For completeness, 3D 
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animations of all discussed systems are pro- 
vided as supplementary materials (movies 
S1 to S48). 

Diffraction of ideal helical structures results 
in X-shaped patterns (36, 37). Thus, we applied 
fast Fourier transformations (FFTs) to our 3D 
reconstructions. We exemplify this analysis 


Von 


SARL. NP 


es 


=~ is 


by HAADF-STEM. Scale bars, 20 nm [(D) and (E)], 50 nm [(F) to (l)], and 

10 nm [(J) and (K)]. Tomography reconstructions [(D), (F), and (H)] reveal 
their surface topography, and selected orthoslices show the growth of 

wrinkles from the gold nanorod seeds and the internal structure of the wrinkle 
network [(E), (G) and (l)]. (J and K) Tomography reconstruction and selected 
orthoslice of an NP obtained by overgrowth of a 30-nm gold nanosphere in 

an (R)-BINAMINE-surfactant mixture. Scale bar, 100 nm. Corresponding 
animated reconstructions are provided in movies S1 (D), S3 (E), S7 (F), S9 (G), 
$18 (H), S20 (I), S43 (J), and S45 (Kk). 
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with chiral nanorods of 210 x 112 nm (Fig. 3, A 
and B), from which the 3D FFT indeed shows 
an X-shaped pattern (Fig. 3C); further details, 
as well as comparison with an idealized model 
(fig. S12), with a smooth nanorod (fig. S13), and 
with a (R)-BINOL-grown nanorod (fig. S13), 
are provided in the supplementary materials. 
Next, the spots in reciprocal space can be linked 
to the corresponding features in real space by 
manually segmenting the 3D FFT to minimize 
noise. By using the segmented 3D FFT as a mask 
(see details in the supplementary materials), an 
inverse FFT (38) was computed and overlayed 
(pink fringes) with the original reconstruction, 
as exemplified in Fig. 3D, so that the helical 
features are visually highlighted. Although the 
use of a 3D FFT does not provide a quantita- 
tive value such as, e.g., the Hausdorff chirality 
measure (39), it provides a qualitative and visual 
way to estimate the presence of chiral fea- 
tures. The inverse FFT enables direct location 
of specific periodicities at a local level. 

Fringes with a right-handed angle can be 
seen in Fig. 3D, and the wrinkles growing on 
the sides of the gold nanorod seeds seemed 
to display a better-defined chiral arrangement 
than those located at the hemispherical tips. 
The observation of a curvature-dependent 
growth was confirmed by a control experi- 
ment, in which seeded growth was performed 
on 30-nm gold nanospheres (figs. S14 and S15). 
Although some chiral features could still be 
observed, the spheres yielded a more ran- 
dom wrinkle organization, as revealed by the 
corresponding HAADF-STEM tomography 
reconstruction (Fig. 2, J and K), and further 
supported by their 3D FFT and correspond- 
ing inverse FFT (fig. S14). Accordingly, more 
intense plasmonic CD bands were recorded 
when gold nanorods were used as seeds 
(g-factor ~0.2; Fig. 4A) compared with spheres 
(g-factor ~0.003; fig. S15). 

Additional evidence behind the general val- 
idity of this chiral growth method and the 
proposed mechanism was provided by exper- 
iments in which we varied the nature of the 
metal deposited during seeded growth. We 
thus implemented the seeded growth of Pt on 
(R)-BINOL-CTAC-covered gold nanorods using 
identical dimensions and similar growth con- 
ditions. Representative results in Fig. 3, E to H, 
show that seeded growth resulted in very reg- 
ular wrinkled platinum coating. The smaller 
dimensions of the wrinkles, as well as the dif- 
ferent electron configuration between plati- 
num and gold (see a 3D EDX reconstruction 
in Fig. 3E, right), allow a better distinction 
of the obtained pattern. Although the tilting 
angle is obviously lower than that measured 
for gold, a helical character is demonstrated 
by the 3D-FFT analysis (Fig. 3, G and H). 
Unfortunately, the lossy character of platinum 
does not allow recording of meaningful plas- 
monic optical activity for these samples. 
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Fig. 4. Effect of nanorod dimensions on chiral 
plasmonic activity. (A) Spectral evolution of the 
anisotropy factor for chiral Au nanorods with 
increasing particle size: 165 x 73 nm (red), 210 x 
112 nm (blue), and 270 x 175 nm (magenta). 

(B) Models of chiral gold nanorods used to simulate 
the chiral plasmonic properties (from left to right: 
165 x 73, 210 x 112, and 270 x 175 nm). 

(C) Calculated anisotropy factor spectra [from 

left to right in (C): red, blue, and magenta]. 


We next demonstrate that the optical ac- 
tivity (CD bands) can be modulated by varying 
the dimensions of chiral nanorods (by varying 
the amount of seeds in the growth solution). 
In all cases, an intense negative Cotton effect 
was recorded for the (R)-enantiomer (29). As 
the size of the nanorods was increased from 
165 x 73 up to 270 x 175 nm, the positive band 
redshifted from 700 up to 1300 nm, and the 
maximum of the negative band shifted from 
1100 up to beyond 1350 nm (fig. S16). Analysis 
of the anisotropy factor showed values ranging 
from 0.1 to 0.2 (Fig. 4A), which are among the 
highest values reported for colloidal plasmonic 
NPs in the visible and near-infrared (NIR) 
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ranges (10, 11). By extending the analysis fur- 
ther into the NIR, up to the limit of our CD 
spectrometer (1600 nm), we recorded even 
higher dissymmetry factors of 0.25 to 0.28 
(fig. S17). However, the noise in the 1400- to 
1500-nm range precluded us from obtaining 
more compelling evidence to confirm the pre- 
cise values. 

When gold nanorods of different dimen- 
sions and degrees of anisotropy were used to 
seed the chiral growth (figs. S18 and S19), sim- 
ilar trends in optical activity were observed. 
However, the anisotropy factor was influ- 
enced by the nanorod dimensions. In general, 
slightly lower g-factor values were obtained 
when shorter and thinner gold nanorods were 
used as seeds (100 x 12 nm; i.e., higher aspect 
ratio), whereas a decrease was observed for 
seeds with similar length to the optimal one 
but thicker in diameter (135 x 52 nm; i.e., 
lower aspect ratio). 

The relation between the chiral features 
observed by electron microscopy and the 
chiroptical activity measured by CD spectros- 
copy was further investigated by electromag- 
netic modeling. We used an accurate full-wave 
solver based on Maxwell’s surface integral equa- 
tions and the method of moments to model the 
plasmonic properties of chiral gold nanorods 
(see the supplementary materials for details) 
(40-42). Three-dimensional computer-aided 
design (CAD) models were constructed on the 
basis of the information provided by the ex- 
perimental electron tomography reconstruc- 
tions (Fig. 4B). Although suitable reproduction 
of the observed morphologies appeared chal- 
lenging, given the intricate network of wrinkles 
covering the chiral gold nanorod surface, we 
created models for the three different dimen- 
sions characterized by electron microscopy: 
165 x 73, 210 x 112, and 270 x 175 nm. To mimic 
the regions with wrinkles displaying differ- 
ent angles, each rod was built with 16 helices 
having four leveled and four inclined steps per 
pitch (pitch of 90 nm). The wrinkle width was 
fixed at 3.5 nm and the separation distance 
between wrinkles was 2.5 nm. 

The simulations revealed chiral plasmonic 
bands that shifted toward longer wavelengths 
with increasing nanorod dimensions, in agree- 
ment with the experimental results (Fig. 4C 
and figs. S20 to $22). Differences in the an- 
isotropy factors can be attributed to the poly- 
dispersity in size and shape of the real sample, 
as well as the difficulty in modeling the exact 
morphology of the NPs. Overall, our theo- 
retical model confirmed the origin of the plas- 
monic chiroptical activity and the importance 
of sharp wrinkles for the emergence of strong 
CD responses. Replacement of BINAMINE by 
L-cysteine did not induce any clear modifica- 
tion on the CD signal obtained for chiral nano- 
rods synthesized with either (R)-BINAMINE 
or (S)-BINAMINE. A substantial modification 
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would be expected if the chiroptical properties 
stemmed from the coupling of the chiral mole- 
cules with the NP plasmon rather than from 
the chiral structural features (fig. S23) (43). 

We have experimentally demonstrated that 
chiral gold nanostructures can be readily ob- 
tained by seeded growth of gold nanorods in 
BINOL-surfactant mixtures, whereas NP col- 
loids with intense CD responses were obtained 
using BINAMINE-surfactant mixed micelles. 
The chiroptical properties stem from the forma- 
tion and stabilization of steep chiral wrinkles. 
These findings point to a dual role of mixed 
micelles: (i) templating the growth of steep 
wrinkles and (ii) the subsequent stabilization 
of such morphological features. 

We propose that the formation of wrinkles 
during seeded growth can be explained by the 
presence of elongated micelles coiled on the 
gold nanorod seeds, which act as patterns di- 
recting the diffusion of micellar aggregates 
containing gold ions, from the aqueous so- 
lution toward the core NC at intermicellar 
regions. Indeed, the size of micelles in solu- 
tion would fit the interwrinkle distance on 
the nanorod surface. However, wrinkle for- 
mation is only induced when the rate of gold 
ion deposition is faster than their diffusion 
on the surface (44). We observed that upon 
lowering the concentration of reducing agent 
(ascorbic acid, from 160 down to 1.6 mM), the 
CD intensity decreased by an order of mag- 
nitude and less-defined surface roughness was 
observed in HAADF-STEM (fig. S24). 

An additional source of surface stabiliza- 
tion is required to preserve the wrinkles after 
growth. In this case, BINAMINE plays such a 
role during the growth process, likely because 
of its two amine moieties in chelating confor- 
mation (33). When one of the amine groups in 
BINAMINE was replaced by a hydroxyl moi- 
ety, the passivation effect was suppressed and 
the growth of chiral features was hindered, 
which resulted in a decrease of CD intensity 
(fig. S25). Once the growth is completed, ad- 
dition of a stronger stabilizing agent (e.g., 
cysteine) is necessary because the adsorp- 
tion of surfactant micelles is not sufficiently 
strong and the NPs may slowly reshape, as 
reflected by the loss of chiroptical properties 
(fig. S26). 

The approach described here for the syn- 
thesis of chiral plasmonic anisotropic NCs is 
simple, reproducible, and holds great poten- 
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tial for the large-scale production intrinsic to 
colloid chemistry methods. Compared with 
the growth of inorganic chiral nanostruc- 
tures based on the direct chemisorption of 
small additives on the NC surface to induce 
chiral growth, the micelle-directed growth 
method relies on the ability of the chiral co- 
surfactant to direct the formation of helical 
micelles. Such supramolecular assemblies 
have sufficient interaction points with the NPs 
to effectively transfer their chirality to them 
during the growth step through a multiva- 
lency effect. 
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A multiplanet system of super-Earths orbiting 
the brightest red dwarf star GJ 887 
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The closet exoplanets to the Sun provide opportunities for detailed characterization of planets 

outside the Solar System. We report the discovery, using radial velocity measurements, of a compact 
multiplanet system of super-Earth exoplanets orbiting the nearby red dwarf star GJ 887. The two planets 
have orbital periods of 9.3 and 21.8 days. Assuming an Earth-like albedo, the equilibrium temperature of 
the 21.8-day planet is ~350 kelvin. The planets are interior to, but close to the inner edge of, the 
liquid-water habitable zone. We also detect an unconfirmed signal with a period of ~50 days, which could 
correspond to a third super-Earth in a more temperate orbit. Our observations show that GJ 887 has 
photometric variability below 500 parts per million, which is unusually quiet for a red dwarf. 


t visible wavelengths, GJ 887 (HD 217987) 
is the brightest red dwarf in the sky and, 
at a distance of 3.29 pc, is the 12th closest 
star system to the Sun. GJ 887 is the most 
massive red dwarf within 6 pc of the 
Sun, which is close enough for a direct stellar 
radius measurement using interferometry (). 
GJ 887s stellar parameters are listed in Table 1. 
Red dwarfs are amenable to radial velocity 
(RV) searches for temperate Earth-mass exo- 
planets: their low luminosity means that tem- 
perate planets have short orbital periods, and 
their low stellar mass implies that Earth-mass 
planets can impart a reflex RV detectable with 
current instrumentation. Although the transit 
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method of planet discovery efficiently detects 
planets because many stars can be simulta- 
neously monitored, it will detect only planets 
that pass through the line of sight between 
Earth and the host star. Consequently, only 
1 to 2% of habitable-zone planets (i.e., those 
with surfaces that can support liquid water) 
are detectable with the transit method. The 
RV method is necessary to achieve a complete 
census of the planets orbiting our closest stel- 
lar neighbors, especially red dwarfs. 

We monitored GJ 887 as part of the Red 
Dots #2 project. Nightly observations were 
taken with the High Accuracy Radial Velocity 
Planet Searcher (HARPS) (2) for 3 months. 
We also obtained contemporaneous photomet- 
ric observations (3). Regular nightly sampling 
combined with photometric observations mit- 
igates against false-positive exoplanet detec- 
tions caused by intrinsic stellar variability and 
other sources of correlated noise. We supple- 
ment our data with >200 archival observations, 
spanning nearly 20 years (3), from HARPS, the 
Planet Finder Spectrograph (PFS) (4), the High- 
Resolution Echelle Spectrometer (HIRES) (5), 
and the University College London Echelle 
Spectrograph (UCLES) (6). We used photom- 
etry from various ground-based observatories 
and the Transiting Exoplanet Survey Satellite 
(TESS) spacecraft (7). Tables S3 and S4 list all 
the data we used. 

We searched for a candidate planet by ad- 
ding a (circular) Keplerian orbit test signal to a 
base model and measuring the improvement 
in the logarithm of the likelihood statistic (3). 
The base model is composed of an offset and an 
instrumental jitter that are added to the measure- 
ment uncertainties for each dataset. We used this 
base model to generate log-likelihood periodograms 
for both the RV and photometric data and then 
searched for signals by plotting the increase in 
the log-likelihood statistic against test period 
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(Fig. 1). The highest peaks were evaluated for 
statistical significance (8, 9). We recursively 
added additional planet test signals, adjusting 
all the parameters to maximize the likelihood 
of all planet signals and the parameters of 
the base model. We continued this iterative 
process until no signals below a threshold of 
0.1% false-alarm probability were found in 
the time series (fig. S1). We detected periodic 
signals at 9.3, 21.8, and 50.7 days (Fig. 1B) and 
verified them using several independent fit- 
ting procedures and algorithm implementa- 
tions (3). The regular sampling of the Red Dots 
#2 dataset helped to disentangle the signals 
under investigation (Fig. 1A). 

Stellar magnetic activity can induce an asym- 
metric distortion of the spectral lines, shifting 
the measured line center and consequently in- 
ducing an apparent RV shift, which may appear 
as a false-positive exoplanet at the stellar rota- 
tion period (J0). The rotation period of GJ 887 
is unknown, so we searched for periodicities 
in the photometric data (3). The archival data 
from 2002-2004 show an ~200-day period, 
but this period was undetectable in the 2018 
quasi-simultaneous photometric observations 
because the time span is too short. Our analy- 
sis of the TESS photometry shows very low 
intrinsic variability, with a semi-amplitude of 
240 parts per million. It is unclear whether 
the low photometric variability of the TESS 
photometry is caused by systematic effects, 
known to affect the TESS observations, but 
we use this value as an upper limit to the in- 
trinsic variability of GJ 887. The TESS variability 
can be explained by one starspot, or a group of 
starspots, with a total diameter of 0.3% of the 
stellar surface, indicating that GJ 887 is slowly 
rotating with very few surface brightness in- 
homogeneities (J7). GJ 887 is less magnetically 
active than most stars with the same effective 
temperature, as demonstrated by: the very low 
starspot coverage; low photometric variability; 
the activity metric derived from stellar Ca 1 H 
and K lines, log(R'ux) = —4.805 (72); and the 
very low Ha activity (13). 

The RV signals are detected in the Red 
Dots # 2 HARPS data alone (Fig. 1A), so we 
investigated additional spectral signatures of 
stellar magnetic activity using this dataset. We 
extracted a time series of the flux in the cores 
of the Na1 D, Ha, and Hf lines and calculated 
the S-index, which is the ratio of flux in the 
cores in the Cam H and K lines compared with 
the continuum (3). The S-index and Na D lines 
both show a weak signal at about 55 days, 
whereas the Ha and Hf lines show a weak sig- 
nal at 38 days (fig. S5). These differing period- 
icities could reflect time scales of various 
stellar activity processes on the star, and de- 
spite being low in amplitude, they make a 
planetary origin for RV signals in the 30-to- 
60-day domain less certain. None of these activ- 
ity periodicities are close to the RV signals at 
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9.3 and 21.8 days, but they do make the RV 
signal at 50.7 days questionable. 

Correlated noise—for example, that caused 
by stellar activity—can be assessed using the 
covariances between observations. To further 
test the planetary origin of the detected RV 
signals, we fitted maximum likelihood model 
functions using two planet models with and 


Fig. 1. Periodograms of the RV data. 
(A) The log-likelihood (Aln L) period- 
ogram obtained separately for all RV data 
before 2018 (brown) and the Red Dots 
#2 campaign (red). (B) The same search 
for a first signal when combining all the 
RV observations together. The vertical 
green lines indicate our derived model 
periods for GJ 887 b and GJ 887 c and the 
third signal. The horizontal dashed lines in 
both panels indicate the false alarm 
probability (FAP) values. 


Fig. 2. Time series of radial 


without Gaussian processes (GPs) (3). All of 
the models including GPs improved the fit to 
the data compared with those without, and the 
amplitude of the signals with periods of 9.3 
and 21.8 days remained unchanged within their 
1o uncertainty. The modeling of the correlated 
noise using GPs therefore does not affect these 
two signals. However, the significance of the 


9.26 days 


21.79 days 


third signal drops substantially when includ- 
ing a GP in the model, casting further doubts 
on its Keplerian nature. Table S4 lists the de- 
rived values and statistics from these models. 

We conclude that the two signals with or- 
bital periods of 9.3 and 21.8 days correspond to 
two exoplanets, GJ 887 b and GJ 887 c, respectively. 
The minimum masses of these planets (77, sin?) 
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measurements over 18 years using 
all instruments. The best-fitting 
model with three Keplerian signals 
is shown with a solid blue line. 

(B) Zoom in on area indicated with 
arrow in (A), showing the Red Dots 
#2 observations. The three-planet 
model is shown with a solid blue 
line, and the two-planet model with 
a solid orange line. It is uncertain 
whether the ~50-day signal has a 0 
planetary origin, but three periodic 
modulations are required to fit the 
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are 4.2 + 0.6 and 7.6 + 1.2 Earth masses (MV), 
which makes them two super-Earth exopla- 
nets with orbital semimajor axes (ap) of 0.068 
and 0.120 astronomical units (au). The inner 
planet has an orbital eccentricity consistent 
with zero, but the outer planet is more likely 
to have a small nonzero eccentricity (Fig. 2). 
We regard the third signal at ~50 days as 
dubious and likely related to stellar activity. 
The fits to our two-planet model and the two- 
planet + third signal model are shown in 
Fig. 2. 

The long-term dynamical stability of the or- 
bits can also be used to test the physical plau- 
sibility of the system and investigate whether 
it contains unusual configurations such as dy- 
namical resonances. We performed a dynamical 
stability study (3) using the software package 
MERCURY6 (J4). We find that all two-planet 
solutions are stable even if eccentricities are 
allowed to vary. The ratio of periods of these 
two planets is close to 7:3, but the simulations 
do not support the presence of a dynamical 
resonance, as there is an absence of oscillating 
orbital alignment variations (75). However, we 
find that the system must be in a dynamically 
active state, driving oscillatory changes in the 
eccentricities of both planets. These interactions 
produce very regular variations, supporting 
the hypothesis that the two-planet configu- 
ration is dynamically stable on very long time 
scales. For a putative system with three planets, 
only ~25% of the 1000 best-fitting models would 
be dynamically stable over 10° years, but this 


is mostly due to the poorly constrained ec- 
centricities. Given that the observations only 
provide upper limits on the eccentricities, we 
investigated orbits that are assumed to be 
initially circular (initial zero eccentricities). Even 
in this three-planet case, >99% of configura- 
tions were found to be stable, meaning that 
the presence of a third planet cannot be ruled 
out using dynamic stability considerations. 

The separations between the planets, in 
units of their spheres of gravitational influence 
(Hill radii), are ~19.1 for GJ 887 b and GJ 887 c 
and ~17.2 for GJ 887 c and GJ 887 d (if GJ 887d 
is real and has a mass of 8.3 Ma); these values 
are consistent with the system having under- 
gone dynamical relaxation (16). Dynamical relax- 
ation in systems of super-Earths results in 
~80% of planets having orbital eccentricities 
€p < 0.1, with the remaining 20% having ep < 
0.3 (17). We examined the expected tidal evolu- 
tion of GJ 887 b using analytical methods 
(8, 19), finding that the tidal circularization 
time scale of GJ 887 b is a few billion years for 
an assumed tidal dissipation parameter Q> = 
1000. The tidal evolution is consistent with our 
observation that GJ 887 b’s orbit is almost 
circular. 

The multiplanet super-Earth system around 
GJ 887 is consistent with recent planet for- 
mation models (20, 21). These models typi- 
cally form chains of multiple planets trapped 
in mean-motion resonances that then migrate 
into orbits close to the central star. Depending 
on where the initial planets formed in the 


protoplanetary disc, they could have accreted 
either large amounts of water ice or just dry 
rocky silicates. As such, the planets may be 
either water-rich or water-poor. At the end of 
the gas disc lifetime, the resonant chains of 
planets can remain stable, yielding systems 
similar to the seven-planet TRAPPIST-1 plan- 
etary system (22), or they can become unstable, 
leading to collisions between planets and thus 
a nonresonant configuration (20). The GJ 887 
planetary system appears to be more consist- 
ent with the latter scenario of unstable evolu- 
tion. The presence of dynamical resonances 
can be very sensitive to the existence or ab- 
sence of additional planets. Consequently, if 
the third signal at 50.7 days is real or if addi- 
tional planets exist, this may result in a more 
resonant system. 

We used standard methods (23) to calculate 
the distances from GJ 887 within which planets 
could support liquid water on their surfaces 
[the star’s habitable zone (HZ)] and found 
that it extends from ~0.19 to 0.38 au. With an 
Qy Of 0.120 + 0.004, GJ 887 c is closer to its host 
star than the HZ but near the inner edge. If the 
~50-day signal is planetary in origin, it cor- 
responds to a super-Earth in GJ 887’s HZ. As- 
suming an albedo, oa, similar to Earth’s (a = 
0.3), the equilibrium temperature, 7.4, of GJ 887 
b and GJ 887 c would be 468 and 352 K, re- 
spectively. Their incident energy fluxes from 
the star (the insolation S) are 7.95 and 2.56 times 
the Sun’s insolation on Earth. Figure 3 shows the 
insolation of known planets orbiting M-type 


Table 1. Stellar parameters for GJ 887 and parameters for GJ 887 b and GJ 887 c. The top half of the table lists parameters for GJ 887: the parallax in milliareseconds 
(mas), distance in parsecs (pc), V- and G-band magnitudes, stellar mass in solar masses (M.), metallicity relative to the Sun [Fe/H], luminosity and radius in solar units, projected 
rotational velocity (vsini), and surface gravity log g. The stellar mass was computed using a mass-radius relation (26). The bottom half of the table lists parameters for GJ 887 b and GJ 887 c: 
K, is this amplitude, Pp is the period, Ser is the incident flux from GJ 887 in units of the incident flux on Earth from the Sun, and Teg is the equilibrium temperature of the planet. 
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Fig. 3. The incident flux (insolation) of planets orbiting M dwarf stars. The dashed vertical lines delimit 
the habitable zone around GJ 887 for the maximum greenhouse planetary atmosphere (left) and the runaway 
greenhouse planetary atmosphere (right) (23). The solid vertical gray lines indicate the range of limits for the 
host stars of all planets plotted; these stars have Te ranging from 2400 to 4150 K (see color bar). GJ 887 b 
and GJ 887 c are indicated by the large red pentagons. Proxima Cen, Proxima Centauri; d, distance. 


red dwarfs as a function of host star apparent 
magnitude. GJ 887 has the brightest apparent 
magnitude of any known M dwarf planet host. 
This brightness, combined with the high photo- 
metric stability of GJ 887, exhibited in the TESS 
data, and the high planet-star brightness and 
radius ratios, make these planets potential tar- 
gets for phased-resolved photometric studies, 
especially in emission (24). Spectrally resolved 
phase photometry has been shown to be sen- 
sitive to the presence of an atmosphere and 
molecules such as COz (25). 
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Changes in contact patterns shape the dynamics of 
the COVID-19 outbreak in China 


Juanjuan Zhang’, Maria Litvinova“, Yuxia Liang’, Yan Wang’, Wei Wang’, Shanlu Zhao*, Qianhui Wu’, 
Stefano Merler*, Cécile Viboud®, Alessandro Vespignani®, Marco Ajelli**{, Hongjie Yu'* + 


Intense nonpharmaceutical interventions were put in place in China to stop transmission of the novel 
coronavirus disease 2019 (COVID-19). As transmission intensifies in other countries, the interplay 
between age, contact patterns, social distancing, susceptibility to infection, and COVID-19 dynamics 
remains unclear. To answer these questions, we analyze contact survey data for Wuhan and Shanghai 
before and during the outbreak and contact-tracing information from Hunan province. Daily contacts 
were reduced seven- to eightfold during the COVID-19 social distancing period, with most interactions 
restricted to the household. We find that children O to 14 years of age are less susceptible to severe 
acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection than adults 15 to 64 years of age 
(odds ratio 0.34, 95% confidence interval 0.24 to 0.49), whereas individuals more than 65 years of age 
are more susceptible to infection (odds ratio 1.47, 95% confidence interval 1.12 to 1.92). Based on these 


data, we built a transmission model to study the impact of social distancing and school closure on 
transmission. We find that social distancing alone, as implemented in China during the outbreak, is 
sufficient to control COVID-19. Although proactive school closures cannot interrupt transmission on their 
own, they can reduce peak incidence by 40 to 60% and delay the epidemic. 


he novel coronavirus disease 2019 (COVID-19) 

epidemic caused by severe acute respi- 

ratory syndrome coronavirus 2 (SARS- 

CoV-2) began in Wuhan City, China, in 

December 2019 and quickly spread glob- 
ally, with 2,063,161 cases reported in 185 coun- 
tries or regions as of 16 April 2020 (J). A total of 
82,692 cases of COVID-19, including 4632 deaths, 
have been reported in mainland China, includ- 
ing 50,333 cases in Wuhan City and 628 cases 
in Shanghai City (2). The epidemic in Wuhan 
and in the rest of China subsided after imple- 
mentation of strict containment measures and 
movement restrictions, with recent cases orig- 
inating from travel (3). However, key questions 
remain about the age profile of susceptibility 
to infection, how social distancing alters age- 
specific contact patterns, and how these factors 
interact to affect transmission. These ques- 
tions are relevant to the choice of control pol- 
icies for governments and policy-makers 
around the world. In this study, we evaluate 
changes in mixing patterns linked to social 
distancing by collecting contact data in the 
midst of the epidemic in Wuhan and Shanghai. 
We also estimate age differences in susceptibil- 
ity to infection based on contact-tracing data 
gathered by the Hunan Provincial Center for 
Disease Control and Prevention (CDC), China. 
Based on these empirical data, we developed a 
mathematical disease transmission model to 
disentangle how transmission is affected by 
age differences in the biology of COVID-19 


infection and altered mixing patterns owing 
to social distancing. Additionally, we project 
the impact of social distancing and school 
closure on COVID-19 transmission. 

To estimate changes in age-mixing patterns 
associated with COVID-19 interventions, we per- 
formed contact surveys in two cities: Wuhan, 
the epicenter of the outbreak, and Shanghai, one 
of the largest and most densely populated cities 
in southeast China. Shanghai experienced ex- 
tensive importation of COVID-19 cases from 
Wuhan as well as local transmission (4). The 
surveys were conducted from 1 February 2020 
to 10 February 2020, as transmission of COVID- 
19 peaked across China and stringent interven- 
tions were put in place. Participants in Wuhan 
were asked to complete a questionnaire de- 
scribing their contact behavior (5, 6) on two 
different days: (i) a regular weekday between 
24 December 2019 and 30 December 2019, 
before the COVID-19 outbreak was officially 
recognized by the Wuhan Municipal Health 
Commission (used as baseline); and (ii) the day 
before the interview (outbreak period). Partic- 
ipants in Shanghai were asked to complete the 
same questionnaire used for Wuhan but only 
report contacts for the outbreak period. For 
the baseline period in Shanghai, we relied on 
a survey conducted in 2017-2018 that followed 
the same design (7). In these surveys, a contact 
was defined as either a two-way conversation 
involving three or more words in the physical 
presence of another person or a direct physical 


contact (e.g., a handshake). Details are given 
in the supplementary materials (SM, sections 
land 2). 

We analyzed a total of 1245 contacts reported 
by 636 study participants in Wuhan and 1296 
contacts reported by 557 participants in Shanghai. 
In Wuhan, the average daily number of contacts 
per participant was significantly reduced, 
from 14.6 for the baseline period (mean contacts 
weighted by age structure: 14.0) to 2.0 for the 
outbreak period (mean contacts weighted by 
age structure: 1.9) (p < 0.001). The reduction in 
contacts was significant for all stratifications 
by sex, age group, type of profession, and house- 
hold size (Table 1). A larger reduction was ob- 
served in Shanghai, where the average daily 
number of contacts decreased from 18.8 (mean 
contacts weighted by age structure: 19.8) to 2.3 
(mean contacts weighted by age structure: 2.1). 
Although an average individual in Shanghai 
reported more contacts than one in Wuhan on 
a regular weekday, this difference essentially 
disappeared during the COVID-19 outbreak 
period. A similar decrease in the number of 
contacts was found in the United Kingdom 
during the COVID-19 lockdown period (8). 

The typical features of age-mixing patterns 
(6, 7) emerge in Wuhan and Shanghai when we 
consider the baseline period (Fig. 1, A and D). 
These features can be illustrated in the form 
of age-stratified contact matrices (provided as 
ready-to-use tables in the SM, section 3.6), where 
each cell represents the average number of con- 
tacts that an individual has with other individ- 
uals, stratified by age groups. The bottom left 
corner of the matrix, corresponding to contacts 
between school-age children, is where the largest 
number of contacts is recorded. The contribu- 
tion of contacts in the workplace is visible in 
the central part of the matrix, and the three 
diagonals (from bottom left to top right) rep- 
resent contacts between household members. 
By contrast, for the outbreak period when 
strict social distancing policies were in place, 
many of the above-mentioned features dis- 
appear, essentially leaving the sole contri- 
bution of household mixing (Fig. 1, B and E). 
In particular, assortative contacts between 
school-age individuals are fully removed, as 
illustrated by differencing baseline and out- 
break matrices (Fig. 1, C and F). Overall, con- 
tacts during the outbreak mostly occurred at 
home with household members (94.1% in 
Wuhan and 78.5% in Shanghai). Thus, the 
outbreak contact matrix nearly coincides with 
the within-household contact matrix in both 
study sites, and the pattern of assortativity by 
age observed for regular days almost entirely 
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Fig. 1. Contact matrices by age. (A) Baseline period contact matrix for 
Wuhan (regular weekday only). Each cell of the matrix represents the 

mean number of contacts that an individual in a given age group has with 
other individuals, stratified by age groups. The color intensity represents the 
number of contacts. To construct the matrix, we performed bootstrap 
sampling with replacement of survey participants weighted by the age 


disappears (SM, section 3.6). These findings 
are consistent with trends in within-city mo- 
bility data, which indicate an 86.9% drop in 
Wuhan and 74.5% drop in Shanghai between 
early January and early February (see SM, 
section 4). Such a large decrease in internal 
mobility is consistent with most of the con- 
tacts occurring in the household during the 
outbreak period. Of note, the strict social dis- 
tancing measures implemented in Wuhan and 
Shanghai did not entirely zero out contacts in 
the workplace, because essential workers con- 
tinued to perform their activities (as observed 
in our data; see SM, section 3.5). 

The estimated mixing patterns are based on 
self-reported contacts that can thus be affected 
by various biases. In particular, reported con- 
tacts for the baseline period in Wuhan may be 
prone to recall bias because contacts were as- 
sessed retrospectively. Further, because of the 
retrospective nature of the baseline survey 
in Wuhan, we were unable to account for the 
lower number of contacts during weekends. 
The more complete data from Shanghai did 
not suffer recall bias and allowed us to weight 
contacts for weekdays and weekends; sensi- 
tivity analyses suggest that this has little impact 
on results (SM, section 8.3). Another possible 
bias is that survey participants may have felt 
pressure to minimize reported contacts that 
occurred during the outbreak, given that social 
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distancing was in place and strictly enforced 
by the government, even if the anonymity and 
confidentiality of the survey were emphasized. 
However, results are robust to inflating reported 
contacts outside of the home severalfold, sug- 
gesting that these compliance and social accept- 
ability biases linked to the outbreak period do 
not affect our main findings (SM, section 8.2). 
Another caveat is that in parallel to population- 
level social distancing measures, case-based in- 
terventions were implemented and could have 
affected contacts, including rapid isolation of 
confirmed and suspected cases and quarantine 
of close contacts for 14 days. However, only a 
small portion of the population in the two study 
sites was affected by contact tracing and quar- 
antine, thus having little to no effect on average 
contact patterns in the general population. 
Next, to understand the interplay between 
social distancing interventions, changes in hu- 
man mixing patterns, and outbreak dynamics, 
we need to consider potential age differences 
in susceptibility to infection. This is currently 
a topic of debate, because little information on 
the age profile of asymptomatic cases is avail- 
able (9, 10). To this aim, we analyzed COVID-19 
contact-tracing information gleaned from de- 
tailed epidemiological field investigations con- 
ducted by the Hunan CDC (SM, section 5). 
Briefly, all close contacts of COVID-19 cases 
reported in Hunan province were placed under 
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distribution of the actual population of Wuhan. Every cell of the matrix 
represents an average over 100 bootstrapped realizations. (B) Same as (A), 
but for the outbreak contact matrix for Wuhan. (C) Difference between 
the baseline period contact matrix and the outbreak contact matrix in 
Wuhan. (D) Same as (A), but for Shanghai. (E and F) Same as (B) and (C), 


medical observation for 14 days and were tested 
using real-time reverse transcription polymer- 
ase chain reaction (RT-PCR). Those who tested 
positive were considered as SARS-CoV-2 infec- 
tions. We estimated the odds ratios (ORs) for a 
contact of a certain age group to be infected, 
relative to a reference age group. We performed 
generalized linear mixed model regression to 
account for clustering and potential correlation 
structure of contacts exposed to the same index 
case (e.g., in the household). We included the 
age group and gender of a contact, type of 
contact, and whether the contact traveled to 
Hubei or Wuhan as regression covariates (SM, 
section 5). We found that susceptibility to 
SARS-CoV-2 infection increased with age. 
Young individuals (aged 0 to 14 years) had 
a lower risk of infection than individuals 
aged 15 to 64 years {OR = 0.34 [95% confidence 
interval (CI): 0.24 to 0.49], p < 0.0001}. By 
contrast, older individuals aged 65 years and 
older had a higher risk of infection than adults 
aged 15 to 64 years [OR = 1.47 (95% CI: 1.12 to 
1.92), p = 0.005]. These findings are in con- 
trast with a previous study in Shenzhen, where 
susceptibility to infection did not change with 
age (9). 

Next, we explore how our data can inform 
control strategies for COVID-19. A key param- 
eter regulating the dynamics of an epidemic 
is the basic reproduction number (Ro), which 
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Fig. 2. Effect of contact patterns on the A 
epidemic spread. (A) Estimated Ro during the 
outbreak (mean and 95% Cl), as a function of 
baseline Ro (i.e., that derived by using the 
contact matrix estimated for the baseline 
period). The figure refers to Wuhan and 
includes both the scenario accounting for the 
estimated susceptibility to infection by age 
and the scenario where we assume that all 
individuals are equally susceptible to infection. 
The distribution of the transmission rate is 
estimated through the next-generation matrix 
approach by using 100 bootstrapped contact 
matrices for the baseline period to obtain the 
desired Ro values. We then use the estimated 
distribution of the transmission rate and the 
bootstrapped outbreak contact matrices to 
estimate Ro for the outbreak period. The 

95% Cls account for the uncertainty on the 
distribution of the transmission rate, mixing 
patterns, and susceptibility to infection 

by age. (B) Same as (A), but for Shanghai. 
(C) Infection attack rate 1 year after the initial 
case of COVID-19 (mean and 95% Cl) as a 
function of the baseline Ro, The estimates are 
made by simulating the SIR transmission model 
(see SM) using the contact matrix for the 
baseline period and considering the estimated 
susceptibility to infection by age and assuming 
that all individuals are equally susceptible to 
infection. The 95% Cls account for the uncertainty 
on the mixing patterns and susceptibility to 
infection by age. (D) Same as (C), but for Shanghai. 


Outbreak Ry 


i?) 


corresponds to the average number of sec- 
ondary cases generated by an index case ina 
fully susceptible population. We estimated the 
impact of interventions on Ro, relying on our 
age-specific estimates of susceptibility to in- 
fection and contact patterns before and during 
interventions. We used the next-generation 
matrix approach to quantify changes in Ro 
(11) (SM, section 6). Additionally, to illustrate 
the impact of age-mixing patterns on the dy- 
namics of the epidemic, we developed a simple 
SIR model of SARS-CoV-2 transmission (SM, 
section 6). In the model, the population is 
divided into three epidemiological categories: 
susceptible, infectious, and removed (either 
recovered or deceased individuals), stratified 
by 14 age groups. Susceptible individuals can 
become infectious after contact with an in- 
fectious individual according to the estimated 
age-specific susceptibility to infection. The 
rate at which contacts occur is determined by 
the estimated mixing patterns of each age 
group. The mean time interval between two 
consecutive generations of cases was taken to 
be 5.1 days, assuming it aligns with the mean of 
the serial interval reported by Zhang et al. (3). 
In the early phases of COVID-19 spread in 
Wuhan, before interventions were put in place, 
Ro values were estimated to range between 2.0 
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and 3.5 (12-18). In this analysis, we extended 
this range from 1 to 4 for the baseline period 
(i.e., before interventions). We find that the 
considerable changes of mixing patterns ob- 
served in Wuhan and Shanghai during the 
social distancing period led to a drastic de- 
crease in Ro (Fig. 2). When we consider con- 
tact matrices representing the outbreak period, 
keeping the same baseline disease transmis- 
sibility as in the preintervention period, the 
reproductive number drops well below the 
epidemic threshold in Wuhan (Fig. 2A) and 
Shanghai (Fig. 2B). This finding is robust to 
relaxing assumptions about age differences in 
susceptibility to infection; the epidemic is still 
well controlled if SARS-CoV-2 infection is as- 
sumed to be equally likely in all age groups 
(Fig. 2, A and B). We also performed sensitivity 
analyses regarding possible recall and compli- 
ance biases of self-reported contacts as well as 
the definition of contact (i.e., considering only 
contacts lasting more than 5 min). The results 
are consistent with those reported here (SM, 
section 8). 

In an uncontrolled epidemic (without inter- 
vention measures, travel restrictions, or sponta- 
neous behavioral responses of the population) 
and for Ro in the range of 2 to 3, we estimate the 
mean infection attack rate to be in the range 53 
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to 92% after a year of SARS-CoV-2 circula- 
tion, with slight variation between Wuhan 
(Fig. 2C) and Shanghai (Fig. 2D). These esti- 
mates should be considered as an upper bound 
of the infection attack rate because they are 
based on a compartmental model that does 
not account for high clustering of contacts 
(e.g., repeated contacts among household mem- 
bers). If we consider a scenario in which so- 
cial distancing measures are implemented 
early on, as the new virus emerges, the esti- 
mated Ry remains under the epidemic thresh- 
old and thus the epidemic cannot take off in 
either location. Furthermore, we estimate that 
the magnitude of interventions implemented 
in Wuhan and Shanghai would have been 
enough to block transmission for an Rp before 
the interventions of up to ~6 in Wuhan and 
~7.8 in Shanghai. 

Next, we use the model to estimate the im- 
pact of preemptive mass school closure. We 
considered two different contact pattern sce- 
narios, based on data from Shanghai: con- 
tacts estimated during vacation periods (7) and 
contacts estimated during regular weekdays, 
after all contacts occurring in school settings 
have been removed (7). Both scenarios repre- 
sent a simplification of a school closure strat- 
egy. Indeed, school closures in response to the 
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Fig. 3. Effect of limiting 
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patterns are considered: 
(i) as estimated for the 
baseline period, (ii) as 
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estimated during school vacations (7), and (iii) as estimated for the baseline period, but suppressing all contacts at school. The inset shows the infection attack 
rate 1 year after the introduction of the first COVID-19 case (mean and 95% Cl). (C) Same as (A), but assuming equal susceptibility to infection by age. 


(D) Same as (B), but assuming equal susceptibility to infection by age. 


COVID-19 pandemic in China have entailed 
interruption of all educational on-site services. 
However, mixing patterns measured during 
school vacations indicate that a fraction of 
children still attend additional educational 
activities, as is typical in Chinese cities. On the 
other hand, when removing all contacts in 
the school setting, we do not consider poten- 
tial trickle-down effects on the mixing patterns 
of other age groups; for instance, parents may 
need to leave work to take care of school-age 
children. Our modeling approach indicates 
that limiting contact patterns to those observed 
during vacations would interrupt transmis- 
sion for baseline Ro up to 1.5 (Fig. 3, A and C). 
Removing all school contacts would do the 
same for baseline Ro up to 1.2. If we apply these 
interventions to a COVID-19 scenario, assum- 
ing a baseline Ro of 2 to 3.5, we can achieve a 
noticeable decrease in infection attack rate 
and peak incidence and a delay in the epi- 
demic, but transmission is not interrupted 
(Fig. 3, B and D). For instance, for a baseline 
Ro of 2.5 and assuming a vacation mixing 
pattern, the mean peak daily incidence is 


reduced by about 64%. In the corresponding 
scenario where school contacts are removed, 
we estimate a reduction of about 42%. Over- 
all, school-based closure policies are not suffi- 
cient to entirely prevent a COVID-19 outbreak, 
but they can affect disease dynamics and 
hence hospital surge capacity. It is important 
to stress that individuals aged 5 to 19 years in 
Shanghai represent 9.5% of the population 
(19), markedly lower than the mean in China 
[16.8% (19)] and other countries [including 
Western countries; e.g., 19.7% in the United 
States (20)]. 

The results of this study should be considered 
in light of the following limitations. In our sim- 
ulation model, we estimated the effect of social 
distancing alone; combining social distancing 
with other interventions would have a syner- 
gistic effect to even further reduce transmission. 
It is likely that population-wide social distanc- 
ing, case-based strategies, and decontamina- 
tion efforts all contributed to achieve control 
in Wuhan and Shanghai, and their effect is 
difficult to separate out in retrospective ob- 
servational studies. Our estimates of age differ- 
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ences in susceptibility to infection are based on 
active testing of 7375 contacts of 136 confirmed 
index cases. These data suffer from the usual 
difficulties inherent to the reconstruction of 
epidemiological links and detection of index 
cases. Contact data are useful, but seroepide- 
miology studies will be essential to fully re- 
solve population susceptibility profiles to 
SARS-CoV-2 infection and disease. Although 
the age patterns of contacts were similar in 
the two study locations during the COVID-19 
outbreak period, these patterns may not be 
fully representative of other locations in China 
and abroad, where social distancing measures 
may differ. Because reliable estimates of the 
contribution of asymptomatic SARS-CoV-2 in- 
fections to transmission are still lacking, we 
did not explicitly model differences between 
symptomatic and asymptomatic individuals. 
We considered a serial interval of 5.1 days (3), 
based on a prior estimate from China, at a 
time when case-based and contact-tracing 
intervention measures were in place, which 
tends to shorten the interval between suc- 
cessive cases. However, this choice does not 
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Table 1. Number of contacts by demographic characteristics and location. N is the number of participants who provided non-missing contact data. 
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tCan differ from total sample size (N = 636) because it also includes participants who had not recorded contacts during the baseline period or during the COVID-19 outbreak. Note that reduced 
denominators indicate missing data. Percentages may not total 100 because of rounding. +The 95% Cls on the mean are calculated by bootstrap sampling. §Difference is calculated by 
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affect the estimated changes in reproduction 
number between the baseline and outbreak 
periods. Modeling results may underestimate 
the effect of social distancing interventions 
because our results concentrate on the num- 
ber of contacts and ignore the type of social 
interactions (e.g., increased distance between 
individuals while in contact or use of a face 
mask), which may have changed owing to 
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increased awareness of the population (27, 22). 
Finally, it is worth noting that our school clo- 
sure simulations are not meant to formulate 
a full intervention strategy, which would re- 
quire identification of epidemic triggers to 
initiate closures and evaluation of different 
durations of intervention (6). Nonetheless, 
our modeling exercise provides an indica- 
tion of the possible impact of a nationwide 
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preemptive strategy on the infection attack 
rate and peak incidence. To generalize these 
findings to other contexts, location-specific 
age-mixing patterns and population struc- 
tures should be considered. Perhaps most 
importantly, strict lockdown strategies of 
the kind implemented in Wuhan, Shanghai, 
and other regions of the world are extremely 
disruptive economically and mentally, and 
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more targeted approaches to block transmis- 
sion are preferable in the long run. We do not 
necessarily endorse blunt lockdown poli- 
cies here; we merely describe their impact on 
COVID-19 transmission based on the Chinese 
experience. 

Our study provides evidence that the inter- 
ventions put in place in Wuhan and Shanghai, 
and the resulting changes in human behavior, 
drastically decreased daily contacts, essentially 
reducing them to household interactions. This 
led to a dramatic reduction of SARS-CoV-2 
transmission. As lockdown measures are put 
in place in other locations, human mixing pat- 
terns in the outbreak period could be cap- 
tured by data on within-household contacts, 
which are available for several countries around 
the world (5-7, 23-25). Moving forward, it will 
be particularly important to design targeted 
strategies for long-term control of COVID-19, 
including school- and work-based control strat- 
egies, along with large-scale testing and contact 
tracing (26-28). Research should concentrate 
on refining age-specific estimates of suscep- 
tibility to infection, disease, and infectiousness, 
which are instrumental to evaluating the im- 
pact of these strategies. 
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Metalens-array-based high- 
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and multiphoton quantum source 
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The development of two-dimensional metasurfaces has shown great potential in quantum-optical 
technologies because of the excellent flexibility in light-field manipulation. By integrating a metalens 
array with a nonlinear crystal, we demonstrate a 100-path spontaneous parametric down-conversion 
photon-pair source in a 10 x 10 array, which shows promise for high-dimensional entanglement 

and multiphoton-state generation. We demonstrate two-, three- and four-dimensional two-photon 
path entanglement with different phases encoded by metalenses with fidelities of 98.4, 96.6, and 
95.0%, respectively. Furthermore, four-photon and six-photon generation is observed with high 
indistinguishability of photons generated from different metalenses. Our metalens-array-based 
quantum photon source is compact, stable, and controllable, indicating a new platform for 


integrated quantum devices. 


uantum optical systems, benefiting from 

the fast speed, long coherence time, ver- 

satile controllability, and large infor- 

mation capacity of photons, is one of 

the most attractive physical systems for 
the investigation of quantum information 
processing. They are widely used in quantum 
communication (J-4), quantum computation 
and simulation (5-8), and quantum metrol- 
ogy and sensing (9, 10). With the development 
of quantum technologies, the requirement 
for both entanglement dimensionality and 
photon number increases, demanding large- 
scale, controllable, and stable quantum pho- 
tonic sources (11, 12). For example, quantum 
communication and imaging require pho- 
tons with high-dimensional entanglement, 
which can be achieved by using different 
degrees of freedom of photons, including or- 
bital angular momentum (OAM), time-bin, 
energy-time, frequency mode, and optical paths. 
However, none of these meets the require- 


‘National Laboratory of Solid State Microstructures, School 
of Physics, College of Engineering and Applied Sciences, 
Nanjing University, Nanjing, 210093, China. Department of 
Electronic and Information Engineering, The Hong Kong 
Polytechnic University, Hong Kong. *Research Center for 
Applied Sciences, Academia Sinica, Taipei 11529, Taiwan. 
4State Key Laboratory of Precision Spectroscopy, School of 
Physics and Electronic Science, East China Normal 
University, Shanghai, 200062, China. °Key Laboratory of 
Intelligent Optical Sensing and Manipulation, Ministry of 
Education, Nanjing University, Nanjing, 210093, China. 
®Collaborative Innovation Center of Advanced 
Microstructures, Nanjing, 210093, China. Key Laboratory of 
Quantum Information, CAS, University of Science and 
Technology of China, Hefei, 230026, China. °Synergetic 
Innovation Center of Quantum Information & Quantum 
Physics, University of Science and Technology of China, 
Hefei, 230026, China. °Department of Electrical Engineering, 
National United University, Miaoli 36003, Taiwan. 
1Department of Physics, National Taiwan University, Taipei 
10617, Taiwan. 

*These authors contributed equally to this work. 
}Corresponding author. Email: wangshuming@nju.edu.cn (S.W.); 
lijian.zhang@nju.edu.cn (L.Z.); zlwang@nju.edu.cn (Z.W.); 
zhusn@nju.edu.cn (S.Z.); dinping.tsai@polyu.edu.hk (D.P.T.) 


Li et al., Science 368, 1487-1490 (2020) 26 June 2020 


ment of high fidelity with large dimensional- 
ity for practical applications. Photonic quantum 
computation and metrology rely on multi- 
photon states, which can be synthesized with 
multiple single-photon sources from spon- 
taneous parametric processes in nonlinear 
materials or by time-multiplexing the spon- 
taneous emission of quantum dots (73), but 
the largest photon number is limited to ~20 
(4). Although quantum dots show excellent 
performance in generating single photons, the 
spontaneous parametric process continues 
to be the main method for the generation of 
high-dimensional and/or multiphoton entan- 
gled states (13). Recent progress on integrated 
photonic systems based on such a process pro- 
vides an ideal platform for large-scale quantum- 
optical sources (15). 

Metasurfaces consist of a dense arrange- 
ment of dielectric or metallic subwavelength 
antennas in an ultrathin interface (16). By con- 
trolling the phase distribution, metasurfaces 
have been widely used to manipulate the 
wavefront of a light field (77-23). Metasurfaces 
have also found applications in the nonclassi- 
cal region (24-27). However, quantum photonic 
sources based on metasurfaces have not yet 
been demonstrated. 

Here, by using a metalens array, we dem- 
onstrate a 100-path spontaneous parametric 
down-conversion (SPDC) photon source. The 
100-path SPDC photon source is realized by 
combining a metalens array with a 0.5-mm 
type II B-barium borate (BBO) crystal (Fig. 1, A 
to D). The metalens array consists of 100 de- 
signed metalenses (arranged in a 10 x 10 array), 
composed of GaN nanopillars fabricated by 
electron beam lithography (EBL), dry etching, 
and resist removing (73). Each metalens is 
designed with a uniform focal length of f= 
1.1 mm at a working wavelength of 404 nm 
and an area of 100 um by 100 um. The period 


of the unit element is 200 nm and the height 
of the nanopillars is 800 nm [design details of 
the metalens are in the supplementary mate- 
rials (73)]. The image of the focal spot array of 
the pump laser, with a wavelength of 404 nm 
(Fig. 1E), shows that the measured focusing 
efficiency of each metalens has a uniform value 
of 56.0 + 6.6%, owing to the precise fabrica- 
tion (fig. S1). 

When a pump laser is incident on the 
metalens array, a 10 x 10 array of focal spots 
is formed inside the BBO crystal. Each of the 
spots can trigger a SPDC process and generate 
a pair of photons in a probabilistic manner. 
The phase-matching condition of the BBO 
crystal is designed to ensure that each photon 
pair has two well-defined beam-like spatial 
modes (28). The uniform intensity and spatial 
distributions of these focal spots enable the 
further realization of high-dimensional en- 
tangled and multiphoton quantum photonic 
sources. As a preliminary test, a vertically po- 
larized diode laser (404 nm, 100 mW) is inci- 
dent on the metalens-BBO system. An array 
of 10 x 10 SPDC photon pairs with nearly 
equal intensities is observed with an electron- 
multiplying charge-coupled device (EMCCD) 
(Fig. 1F). 

First, the path-encoded quantum entangle- 
ment is demonstrated. Each metalens in the 
system generates a pair of photons in a pair 
of conjugate spatial modes (figs. $2 and S3). 
These modes are defined as Sg tO S99 for the 
signal photon (vertically polarized) and as ig 
to igg for the idler photon (horizontally po- 
larized). In the case that only one pair of pho- 
tons is generated from the source, without 
knowing which metalens the photons are gen- 
erated from, the two-photon state can be writ- 
ten as 3 (|0, 0) +|1, 1) +|2,2) +... +|99, 99)), 
where the number represents the path defined 
previously. This state is a 100-dimensional 
path-encoded quantum entangled state. As 
a demonstration, we verify the quantum en- 
tanglement in two, three, and four dimensions 
by reconstructing the reduced quantum states 
with quantum state tomography (QST) mea- 
surements and discuss the high-dimensional 
entanglement measurement in the supplemen- 
tary materials (13). 

Two-dimensional (2D) entanglement states 
are analyzed using photon pairs generated 
by two metalenses, including the adjacent and 
nonadjacent ones. The QST measurements 
are performed (fig. S4) and the correspond- 
ing density matrix is accurately reconstructed 
by the maximum likelihood estimation (MLE) 
method. Figure 2A shows the experimental 
result of a maximally 2D path-entangled state 
of (|00) + |11))/./2. The fidelity between the 
reconstructed state and the ideal maximally 
entangled state is as high as 0.985, proving that 
the produced state is very close to the ideal 
state. In addition to focusing the pump light, 
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Fig. 1. Schematic and characterization of the quantum metalens array. (A) Schematic of the quantum source based on a metalens array. (B) Microscopy 
image of the metalens array. (€ and D) Scanning electron microscopy image of the GaN metalens in (C) top view and (D) side view. (E) Microscopy image of the focal 
plane of the pump light. (F) Image of the SPDC photon pair array recorded by EMCCD. Inset: Magnified image of the area indicated by the blue dashed box. 

The red dashed box shows one pair of photons generated by one metalens. Scale bars: (B) 200 um, (C) 2 um, (D) 1 um. 


the metalens is able to encode its phase, which 
in turn controls the relative phase between dif- 
ferent terms of the generated quantum state. 
We design the metalenses with different phase 
differences (Ag = 1/2, 7, 32/2) between two 
adjacent metalenses and analyze the 2D entan- 
glement states generated with them (figs. S5 
and S6). Figure 2, B to D, are the reconstructed 
density matrixes of the prepared states, with 
the phase differences of (Ag = n/2,1, 31/2), 
respectively. All these density matrixes show 
good agreement with the ideal corresponding 
states of (|00) + ¢|11))/./2, (|00) — |11))/V2 
and (|00) — 4|I1))/1/2, respectively. Without loss 
of generality, we measure six 2D entangled 
states, with respect to adjacent metalenses 
pairs at random locations, for each phase dif- 
ference and estimate them by the QST mea- 
surements. All the results show high fidelities, 
with an average value of 0.979, to the maxi- 
mally entangled states and a fidelity of 0.984 
to the closest pure entangled states (fig. S6). 
Other results with nonadjacent metalens pairs 
are shown in fig. S7. The fidelities to the maxi- 
mally entangled states are all above 0.96 (13). 
These high fidelities confirm that we can 
reliably control the generated states by meta- 
nanostructure. 
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Fig. 2. Characterization of entangled quantum states. (A to D) Typical reconstructed density matrixes by 
QST measurements with metalens phase differences of Ag = 0, x/2, x, and 3x/2, respectively. The quantum 
state fidelities are 0.985, 0.987, 0.987, and 0.984 to the maximally entangled states, respectively. 


Quantum entangled states with higher di- 
mensions are further characterized. Figure 3, 
Band D, are typical 3D and 4D density matrixes 
reconstructed by QST measurements and cor- 
respond to the configuration of the photon pair 


sources in the dashed boxes in Fig. 3, A and C, 
respectively. In these high-dimensional cases, 
the beam displacer (BD)-based measurement 
setup becomes considerably more complex, 
as shown in the inset of fig. S4. There are not 


2 of 4 


RESEARCH | REPORT 


enough degrees of freedom to balance the 
additional accumulation of experimental 
fluctuations introduced by the BD-based 
spatial mode combing and analyzing sys- 
tems. It is a challenge to reconstruct ideal 


A B 


Re(p) 


33. 00 


density matrixes of the maximally entan- 
gled states. Here, the reconstructed results 
show very high fidelities to the pure quan- 
tum states. The fidelities to the estimated 
pure states are 0.966 for the 3D entangled 


im{p) 


Fig. 3. High-dimensional quantum entanglement states characterization. (A to D) Reconstructed 
density matrixes for 3D (B) and 4D (D) quantum state by QST measurements corresponding to the photon 
pairs in the dashed blue boxes in (A) and (C), respectively. 
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Fig. 4. Multiphoton quantum source based on a metalens array. (A) Four-photon and (B) six-photon 
coincidence dependence to the pump power. The red dashed lines are the theoretical estimation of quadratic 
(four-photon) and cubic (six-photon) trend. The insets show the metalenses involved in the corresponding 
tests. The schematic diagram (C) and the fringe (D) of the HOM interference measurement. 


Li et al., Science 368, 1487-1490 (2020) 


26 June 2020 


state (Fig. 3B) and 0.950 for the 4D entan- 
gled state (Fig. 3D), and the fidelities to the 
maximally entangled states are 0.965 and 0.911, 
respectively. These results, in addition to the 
measured results in the supplementary ma- 
terials (73), show entangled fidelities higher 
than 0.94 to the closest pure states (figs. S8 
and S9), manifesting the high qualities of the 
high-dimensional entangled states from the 
metalens-array-based quantum source. The de- 
creases in the measured fidelities are mainly 
caused by the long QST measurement time for 
high-dimensional entangled states, in which 
the drift of the measurement system cannot 
be ignored. Therefore, characterization of the 
entanglement in higher dimensions may need 
to resort to the development of integrated de- 
vices (13, 29). 

Furthermore, the multiphoton source based 
on the metalens array system is characterized. 
Unlike the general SPDC-based multiphoton 
sources, in which multiple nonlinear crystals 
and long-time stable complex optical setup 
are required (14), our source only needs one 
nonlinear crystal and the setup is substantially 
more compact and stable. The experimental 
setup is discussed in the supplementary mate- 
rials (13), where a 415-nm femtosecond pulsed 
laser is introduced as the pump light to increase 
the possibility for multiphoton generation 
(fig. S10). Here, we use the same metalens array 
sample shown in Fig. 1 and the BBO crystal de- 
signed for the 415-nm pump laser (73). Because 
each metalens may generate one photon pair 
simultaneously, we can obtain multiphoton- 
pair sources within this 100-metalens array. 
As a demonstration, we characterize the per- 
formance of the four-photon and six-photon 
source from two and three adjacent metalenses, 
respectively. Figure 4, A and B, show the pump 
power dependencies of the four-photon and 
six-photon coincidence counts. The ideal power 
dependencies of the four-photon and six-photon 
coincidence counts follow a quadratic and cubic 
relationship, respectively, as shown in the red 
dashed lines in Fig. 4, A and B. The measured 
data, shown in the blue circles, agree well with 
the ideal trend, indicating a feasible multipho- 
ton source. 

We further carry out the Hong-Ou-Mandel 
(HOM) interference to test the purity and in- 
distinguishability of photons generated from 
different metalenses. Two independent photon 
pairs are generated by two adjacent metalenses 
(Fig. 4C). One photon from each pair is used 
as the trigger (heralding), and two heralded 
photons interfere at a 50:50 fiber beam splitter. 
The HOM interference is observed by record- 
ing the fourfold coincidence as a function of 
the relative delay between the heralded pho- 
tons. Figure 4D is the measured HOM inter- 
ference result, and the visibility of the HOM dip 
is 86.3%. This clearly verifies the performance 
of the multiphoton quantum source and indicates 
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that such metalens array can successfully 
provide a compact platform for the multi- 
photon source preparation. 

On the basis of the metalens array, we dem- 
onstrate a compact, stable, and controllable 
platform for a quantum optical source in both 
high-dimensional entanglement and multipho- 
tons, which extends the spatial configuration 
and entanglement dimensionality of the inte- 
grated path-encoded quantum source. Our re- 
sults indicate that metasurface structures can 
provide a route for the generation and control 
of complex quantum states, not only increas- 
ing the quantum system dimensionality but 
also allowing for the coherent control of mul- 
tiple photons, thus providing a compact and 
practical platform for the development of ad- 
vanced on-chip quantum photonic informa- 
tion processing. 
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Quieting a noisy antenna reproduces photosynthetic 


light-harvesting spectra 


Trevor B. Arp'2*, Jed Kistner-Morris'@*, Vivek Aji¢, Richard J. Cogdell®*+, 


Rienk van Grondelle*>+, Nathaniel M. Gabor?24+ 


Photosynthesis achieves near unity light-harvesting quantum efficiency yet it remains unknown whether 
there exists a fundamental organizing principle giving rise to robust light harvesting in the presence of 
dynamic light conditions and noisy physiological environments. Here, we present a noise-canceling 
network model that relates noisy physiological conditions, power conversion efficiency, and the resulting 
absorption spectra of photosynthetic organisms. Using light conditions in full solar exposure, light 
filtered by oxygenic phototrophs, and light filtered under seawater, we derived optimal absorption 
characteristics for efficient solar power conversion. We show how light-harvesting antennae can 

be tuned to maximize power conversion efficiency by minimizing excitation noise, thus providing a 
unified theoretical basis for the observed wavelength dependence of absorption in green plants, 


purple bacteria, and green sulfur bacteria. 


n photosynthesis, light energy harvesting 
begins with the absorption of sunlight. 
Photoexcitation energy is rapidly trans- 
ferred through an antenna network before 
reaching the reaction center, where charge 
transfer converts excitation energy into an 
electrochemical potential gradient across the 
photosynthetic membrane (7). Even in the 
presence of dynamic light conditions, rapidly 
fluctuating molecular structure, and highly in- 
tricate energy transfer pathways (1-5), the 
light-to-electron conversion process exhibits 
near unity quantum efficiency. The delicate 
interplay of quantum effects with molecular 
mechanisms of energy management, e.g., non- 
photochemical quenching (6-8), has been ex- 


Fig. 1. Light-harvesting A 
noisy antenna. (A), Sche- 

matic of a photosynthetic A 
antenna reduced into a 

network with two input 

nodes A and B with input 

rates P, and Pg and output B 
O with rate Q. Energy is 

absorbed by molecules a 

and b (at rates P, and Pg) 

and is transferred to the 

output as usable energy. Cc 
(B) Schematic two-channel 


plored across diverse phototrophs (9-12) but 
the elementary connection between highly ro- 
bust light energy harvesting and energetic 
fluctuations has not been established. 
Transforming noisy inputs into quiet out- 
puts represents a general design challenge in 
network architectures including multinational 
energy grids (13-17), auditory and visual neural 
networks (/8-2/), and nanoscale photocells 
for next-generation optoelectronics (22). Al- 
though network inputs exhibit statistical fluc- 
tuations (e.g., rapid changes of sunlight absorbed 
by a leaf or solar panel), network outputs may 
demand a steady rate of information or energy 
for optimal performance (e.g., constant power 
from the grid to maintain indoor lighting). 
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Simulated average excitation energy as a function of time within a noisy antenna composed of 10 sets of a and 
b molecules. (Right) Time-averaged histogram of the internal energy (detailed in the supplementary materials, 
section S1.4). The antenna is subject to internal (fast) and external (slow) fluctuations. Over long time scales, 
the time-averaged histogram resembles a normal distribution (black line). 
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Statistical fluctuations—arising from environ- 
mental variations and internal processes— 
fundamentally limit the throughput efficiency 
of any network. If the flow of energy (power) 
into a network is substantially larger or smaller 
than the flow out of the network required to 
optimally match the output demand, then 
the network must adapt or be structured in 
such a way as to reduce the sudden overflow 
or underflow of energy. When the network 
fails to manage fluctuations, the results may 
be extraordinary (e.g., photo-oxidative stress 
in photosynthetic light harvesting or explosive 
damage to transformers caused by fluctua- 
tions in the grid). 

We constructed a model that uses general- 
izations of networks to extract essential aspects 
of photosynthetic light harvesting (Fig. 1). A 
simple network of nodes (points at which lines 
intercept) connected by links (connecting lines) 
represent physical objects: excitation energy 
levels and intermolecular transfer events within 
the antenna system, respectively. In photosyn- 
thesis, light enters the antenna through a large 
number of pigment molecules, each of which 
is amember of a small set of distinct molecular 
species (e.g., chlorophyll a and b). Our model 
considers the advantage in having light en- 
tering the network through two classes of ab- 
sorbing excitation energy levels, nodes A and 
B, which can absorb powers P, and Pz. 

After an absorption event, excitation energy 
moves between internal nodes of the antenna 
network, representing the excitation of inter- 
mediate states within the biological antenna 
complex (2, 17, 23). As an example, excitation 
energy absorbed by a chlorophyll b molecule 
in the light-harvesting complex LHC2 is trans- 
ferred to chlorophyll a, and from there to an- 
other chlorophyll a in the same or in another 
complex. There are many such pathways through 
the antenna network that may share interme- 
diate links, but each specific pathway (Fig. 1A, 
colored lines) eventually terminates on the 
output, O, which on average delivers power Q. 
Rather than model each pathway, we modeled 
the average behavior from all inputs A and B 
through all pathways in the network. Thus, 
we define probabilities p, and pz, which are 
the total probabilities that any input A and 
Bwill absorb. Mathematically, the example 
pathway given above, along with all other 
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pathways that originate on a node B, is con- 
tained in pz. The average power (or rate of 
energy) coming from the B absorbers is there- 
fore pgPp. This is the average value; the actual 
flow of excitation energy at any given time is 
stochastic. 

By analyzing the stochastic flow of excitation 
energy, we can characterize the antenna net- 
work by statistical averages (power throughput) 
and fluctuations in the rate of energy flow, 
which we will call noise (see the supplemen- 
tary materials, section S1). The power through- 
put of the antenna system is determined by 
the external light conditions, the absorption 
characteristics of the absorbing pigment mol- 
ecules (Fig. 1B) or input nodes, and the molec- 
ular dynamics of the network. The antenna 
inputs are described in the usual way: Light 
absorption by the pigment molecules is char- 
acterized by peak width w, separation Ad = 
|Ag - A4|, and the center wavelength (or aver- 
age distance) between the peaks Ao. The solar 
spectral irradiance (Fig. 1B, gray line), which 
varies as light propagates through air, the can- 
opy, or seawater, gives the average power 
available within a given range of wavelengths. 
Choosing the wavelength of an absorption peak 
simultaneously specifies both the excitation 
energy and power entering the noisy antenna. 
Although the excitation energy is inversely 
proportional to wavelength, the absorbed power 
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P, or Pz is the integrated product of the spec- 
tral irradiance and the absorption character- 
istics of the light-harvesting antenna. 

Because the absorbed solar power rarely 
matches exactly the rate of optimal output, a 
finely tuned network is one that most effec- 
tively balances minimizing the internal noise 
with robustness against external noise. Noise 
in the antenna arises from two main sources: 
inherent mismatch between inputs and out- 
put, which may arise because of fast dynam- 
ics in the protein structure and corresponding 
electronic properties, and dynamic external 
light conditions. In photosynthesis, an over- 
powered antenna will produce excess en- 
ergy that can drive deleterious back reactions 
(24, 25). Conversely, a light-harvesting network 
in an underpowered state produces nonoptimal 
output, because the rate of energy transfer 
out of the network is fixed by electrochem- 
ical processes (26). Over long periods of time, 
the degree to which the light-harvesting net- 
work is overpowered or underpowered is mea- 
sured by the mean-squared deviation (i.e., 
noise) of the total input power (through P, 
and Pz) from the optimal output power at Q, 
(Fig. 1C) (see the supplementary materials, 
section S1). 

Tuning only the absorption characteristics, 
our goal was to find a network that spends the 
least amount of time in a state for which the 


random external fluctuations 


input power is too large or too small compared 
with the output of the network, thus maximiz- 
ing power conversion efficiency (Fig. 1C). Within 
our model, probabilities p,, and pz couple the 
inputs of the network P, and P; to the output 
Q: p4P4 + PpPsz = Q. From this expression, we 
first evaluate the variance of the average dis- 
tribution p4P, + pgPs. Minimizing this vari- 
ance yields the optimal values of P, and Pz to 
quiet the antenna. We then input the local 
spectral irradiance to a model optimization 
function, the maxima of which determine the 
optimal absorption characteristics for noise 
cancellation (see the supplementary materials, 
sections S1.1 to $1.3). 

Using this framework, we can predict the 
behavior of three noise regimes within the 
antenna network: overtuned, finely tuned, 
and poorly tuned (Fig. 2). For simplicity, these 
examples are where Q = (P, + Pg)/2, whereas 
a broader parameter range is explored in de- 
tail in the supplementary materials, section 
$1.2. Although the light conditions are iden- 
tical for all three cases (Fig. 2A, gray lines), 
we can examine how the noise changes with 
different absorption characteristics (details 
of this calculation can be found in the supple- 
mentary materials, section $1.4). When the ab- 
sorbing peaks are spaced too closely (Fig. 2A, 
top), the inherent antenna noise can be strongly 
reduced, and in the limit that P, = Q = Pp, there 
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Fig. 2. Quieting a noisy antenna by tuning the absorption characteristics. 
(A) Absorption peaks for two absorbers a and b overlaid on an ideal blackbody 
solar spectrum (T = 5500 K, gray line) for three cases: two closely spaced 
absorbers (top), two absorbers separated to optimize the noisy antenna 
(middle), and two widely separated absorbers (bottom). (B) Simulated 
excitation energy versus time for a two-channel antenna with three different 
values of A, comparable to the cases shown in (A). The left side shows 
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the excitation energy time traces without external fluctuations. The right side 
includes random external fluctuations. (©) Histograms of time spent in 
overpowered (red) and underpowered (blue) states for the three series in (B). 
In the top panel, the distribution is flat and favors no value. In the middle 
panel, the distribution is a sharply peaked normal distribution that favors 

Q. In the bottom panel, the distribution is normal but wider than that in the 
middle panel. 
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Fig. 3. Comparison of the noisy antenna model with photosynthetic 
absorption spectra in three distinct niches. (A) Absorption spectrum of 
LHC2 (blue) (27) overlaid on the terrestrial solar spectrum (light gray) (31). 
(B) Absorption spectrum of the LH2 complex (28) overlaid on the solar 
spectrum measured below a canopy of leaves (light gray) (see the 
supplementary materials, section S3, for details). (©) Absorption spectra of 
bacteriochlorophyll c (blue) and e (green) (29, 30) compared with the 


are negligible fluctuations in the rate of en- 
ergy flow (Fig. 2B, top left). This lower bound 
to the internal noise cannot be reached in nat- 
ural photosynthetic antennae, where protein 
dynamics will always drive fluctuations of in- 
termediate excitation energy transfer events. 
Rather, the overtuned antenna noise is di- 
rectly proportional to, and thus dominated by, 
changes in the varying light spectrum (Fig. 
2B, top right). In the presence of random ex- 
ternal fluctuations, the distribution of time 
spent in an overpowered or underpowered 
state is flat (Fig. 2C, top). In the overtuned 
antenna, the average input rarely matches the 
optimal output. 

A poorly tuned antenna (Fig. 2A, bottom) is 
similarly deficient. If the absorbing peaks are 
well separated, then the antenna spends most 
of the time overpowered or underpowered. 
When the power sources P, or Pz are significantly 
greater or less than the power sink (P4 >> Q >> 
Pp), the noise (as evidenced by a histogram of 
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with dashed lines. 


the excitation energy) in the poorly tuned 
antenna becomes broader as the absorbing 
peaks become more separated (Fig. 2C, bottom). 
When viewed over long times, the poorly tuned 
antenna spends too little time outputting the 
optimal power Q. 

The finely tuned antenna absorbs at specific 
positions on the spectrum that give rise to 
robust light harvesting even in the presence 
of both varying light conditions and substan- 
tial internal noise. Compared with the over- 
tuned and undertuned cases, the finely tuned 
antenna allows for intermediate internal noise 
levels (Fig. 2B, middle) yet delivers a narrow 
distribution of power centered at the optimal 
output Q (Fig. 2C, middle). Robustness in light 
harvesting is thus the ability to output, on aver- 
age, the optimal rate Q yet simultaneously allow 
for internal noise. 

To determine the optimal absorption spec- 
trum for robust light harvesting, we computed 
the spectral positions for which the peaks 


400 


BChI (green sulfur bacteria) 
i‘ | experiment 


600 800 
wavelength (nm) 


calculated solar spectrum at a 2-m depth of water (light gray) (32, 33). 
(D to F) Predicted ideal absorption peaks from optimizing A = P4 - Pg for 
the full solar spectrum, the solar spectrum attenuated through canopy, 
and the solar spectrum attenuated through seawater, respectively (see the 
supplementary materials, sections S2 and S3, for optimization and spectra 
details, respectively). Photosynthetic absorption peaks are identified 


are as close as possible on the light spectrum 
(favoring reduced internal noise) yet the dif- 
ference in the absorbed power A = P, — Pz is 
maximized (supporting robustness against 
external variations). This condition is equiv- 
alent to maximizing the derivative of the light 
spectrum with respect to wavelength, thus 
resulting in absorption peaks in regions of 
steepest slope (see the supplementary ma- 
terials, section S1.3). The absorption spectra, 
and thus the excitation transitions, are tuned 
so that the time-averaged sum of input excita- 
tion energy is sharply peaked at the output rate 
(Fig. 2C, middle). 

For three prototypical photosynthetic an- 
tennae, the light-harvesting complex of green 
plants (LHC2), the light-harvesting complex 
of purple bacteria (LH2), and the bacterio- 
chlorophyll c and e pigments of green sulfur 
bacteria (BChl c and e), the natural absorp- 
tion spectrum (Fig. 3, A to C) (27-30) can be 
compared with that predicted by our model 
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(Fig. 3, D to F) (see the supplementary ma- 
terials, section S2, for full details), which takes 
as input the local irradiance spectrum, shown 
as solid gray lines in Fig. 3, A to F. The ab- 
sorption peak positions and spectral sepa- 
ration predicted under light conditions in air 
(31), under canopy, or under seawater (32, 33) 
(colored lines in Fig. 3, D, E, and F, respectively) 
show strong agreement with the absorption 
spectra of these three phototrophs. Using only 
the external light spectrum and the linewidth 
w, the predicted peak center position A) and 
separation Ad reproduced the measured ab- 
sorption peaks with an average error of 2.1% 
(Table 1). 

The noisy antenna model also reproduced a 
general feature of photosynthetic light harvest- 
ing that was observed across all three proto- 
typical phototrophs. Photosynthetic pigments 
do not absorb at the maximum solar power. In- 
stead, all three phototrophs considered exhibit 
pairs of closely spaced peaks in regions where 
the spectrum shows a steep rate of change with 
respect to wavelength. Photosynthetic plants 
look green because their antenna complexes 
absorb light across the visible spectrum, in- 
cluding the blue and red portions, yet reflect 
green wavelengths (Fig. 3D). Purple bacteria are 
aquatic phototrophs (34). They have adapted to 
sunlight that is filtered through the canopy of 
trees and floating aerobic phototrophs (Fig. 3E, 
gray line, and see the supplementary mate- 
rials, section S3) and use a light-harvesting 
complex in which bacteriochlorophyll dom- 
inates light absorption away from the visible, 
including green (Fig. 3E). Green sulfur bacteria 
are a geographically diverse group of bacte- 
ria that are adapted to solar light shining through 
seawater to depths where it is anaerobic (35). 
They do not absorb the peak intensity of this 
attenuated light spectrum and instead ab- 
sorb in the region of steepest spectral rate of 
change. 

Underwater phototrophs provide an excel- 
lent natural experiment with which to test 
the predictive strength of our model because 
the solar spectrum is highly variable as a func- 
tion of depth (36). When considering the pen- 
etrating spectrum below the seawater surface, 
light intensity is attenuated as depth increases, 
particularly in the red and infrared zones, be- 
cause of absorption and scattering in sea- 
water (Fig. 4A). By comparing the absorption 
spectra of subsurface marine phototrophs, 
such as green sulfur bacteria, with those pre- 
dicted by quieting a noisy antenna, we can 
explore whether the natural photosynthetic 
absorption spectrum matches our model pre- 
dictions for the relevant phototroph’s pre- 
ferred depth. 

From these attenuated solar spectra, we cal- 
culated an optimization parameter A°? as a 
function of Ad and Ao. A°? is a function mod- 
ified from the calculation of A = P, — Pg such 
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Fig. 4. Predicted absorption spectra of a finely tuned noisy antenna under seawater. (A) Solar spec- 
trum in air (31) and attenuated by various depths of water (labeled) (32, 33). (B) Optimization landscape 
calculation of A°P versus center wavelength 49 and peak separation Aa for the solar spectrum under 1 m 
of seawater (w = 15 nm). Red points identify two equally favorable maxima corresponding to a set of peaks on 
either side of the spectral maximum. (C to F) Ideal absorption peaks predicted from the solar spectrum 

at each depth. (D) shows the peaks extracted from the calculation in (B) color coded blue, green, orange, and 
red to track peak locations with depth. 


—————— EEE 
Table 1. Absorption peak data versus model calculation. 


Actual value, Calculated value, Relative % error 


Peak name nm (eV) nm (eV) from nm (eV) Reference 
Chlorophyll a vee 428 (2.90) A929 (2.89) 0.23 (0.34) 0 22) 
Chlorophyll b 1 440 (2.82) 459 (2.70) 4.32 (4.26) (27) 


2.25 (1.94) 


116 (1.04) 
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that its maxima quiet a noisy antenna (see 
the supplementary materials, section S1.3). 
An example color map of the magnitude of 
A°? at a depth of 1 m and w = 15 nm reveals 
two maxima in the color plot near Ag = 400 
and 750 nm (Fig. 4B). These maxima identify 
the wavelength characteristics of a finely 
tuned antenna under seawater. By extract- 
ing the values of AQ and A, at the maximum 
in A, we obtain the characteristic absorption 
spectra of the fine tuned antenna as a function 
of seawater depth (Fig. 4, C to F). We found that 
quieting a noisy antenna under 2 m of seawater 
accurately reproduces the absorption spec- 
trum of green sulfur bacteria. Although highly 
adaptable, green sulfur bacteria are known 
to thrive at 1 to 2 m below the surface (37), co- 
inciding with the conditions for which their 
light-harvesting antenna is finely tuned for 
solar power conversion. 

The degree to which we were able to repro- 
duce photosynthetic absorption spectra is a 
surprising result, suggesting an underlying 
organizing principle for light-harvesting sys- 
tems: Fluctuations fundamentally limit the 
efficiency of networks and must be avoided. 
Phototrophs must balance environmental in- 
puts to sustain steady production and storage 
of fuel under substantially different environ- 
mental conditions. Phototrophs across many 
photosynthetic niches may have adapted to 
build fluctuation-canceling light-harvesting 
antennae onto which other active mechanisms 
for reducing fluctuations can be added (e.g., 
nonphotochemical quenching) (6-8). Although 
the connection of our model to natural antenna 
systems requires detailed quantum models, our 
framework gives new insight into how extinc- 
tion coefficients, delocalization lengths, and 
radiative rates conspire to reduce noise in nat- 
ural antennae. Moreover, our findings will in- 
spire comprehensive experiments in which 
the light environment is carefully controlled 
while the absorption spectrum of adaptable 
model organisms is monitored. By developing 
noise-canceling antennae as a technological 
foundation, natural and artificial energy- 
harvesting networks—from bacteria thriving 
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underwater to extended power grids—could 
be adapted to efficiently convert noisy inputs 
into robust outputs. 
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DOG GENOMICS 


Arctic-adapted dogs emerged at the 
Pleistocene—Holocene transition 
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Although sled dogs are one of the most specialized groups of dogs, their origin and evolution has 
received much less attention than many other dog groups. We applied a genomic approach to investigate 
their spatiotemporal emergence by sequencing the genomes of 10 modern Greenland sled dogs, an 
~9500-year-old Siberian dog associated with archaeological evidence for sled technology, and an 
~33,000-year-old Siberian wolf. We found noteworthy genetic similarity between the ancient dog and 
modern sled dogs. We detected gene flow from Pleistocene Siberian wolves, but not modern American 
wolves, to present-day sled dogs. The results indicate that the major ancestry of modern sled dogs 
traces back to Siberia, where sled dog-specific haplotypes of genes that potentially relate to Arctic 


adaptation were established by 9500 years ago. 


espite decades of studies, consensus has 

yet to be reached on when and where 

dogs were first domesticated and when 

they were first deliberately used in many 

of the roles they exhibit today. In Siberia, 
late Upper Paleolithic artifacts of carved bone, 
antler, and ivory similar to tools used by modern 
Inuit for securing dog harness straps suggest 
ancient origins of dog sledding (1). Furthermore, 
archeological findings from Zhokhov Island pro- 
vide evidence of sled technology and dogs by 
the Sumnagin Mesolithic culture ~9000 to 
8000 years ago (1-3) (fig. S1), offering an op- 
portunity to use genomics to further our un- 
derstanding of early dog domestication and 
the origin of sled dogs. 

We generated nuclear genomes from a 
dog mandible present at this site (“Zhokhov,” 
9.6x coverage), dated to 9524 calendar years 
before present (YBP) (Fig. 1A and fig. S2), 
and a Siberian Pleistocene wolf mandible 


(“Yana,” 4.7x coverage), dated to 33,019.5 YBP 
(Fig. 1A and fig. $3). In addition, we sequenced 
10 modern Greenland sled dog genomes, a 
dog best described as an indigenous land- 
race breed used for hunting and sledging by 
Inuit. Samples consisted of two individuals 
from each of five geographically diverse lo- 
calities (Fig. 1A), thus providing a broad rep- 
resentation of the indigenous dog diversity. 

We analyzed our data alongside genomes 
from 114 geographically and genetically diverse 
canids (table S1) using whole-genome pairwise 
distances, principal component analysis, TreeMix 
(4) admixture graphs, and D statistics (Fig. 1). 
Yana appeared alongside wolves (Fig. 1, B and C), 
whereas Zhokhov was found to be most closely 
related to dogs. Specifically, Zhokhov was most 
similar to modern sled dogs (Greenland sled 
dogs, Alaskan malamutes, and Alaskan and 
Siberian huskies) and American pre-European- 
contact dogs (PCDs), best illustrated by the ~2x 


Port au Choix dog from Maritime Archaic cul- 
tural context ~4000 YBP (3). Unsupervised clus- 
tering analyses with NGSadmix software (4) 
(fig. S6) grouped modern domestic dogs into 
four clusters: African, European, Asian, and 
sled dogs including Zhokhov. These relation- 
ships were confirmed by an admixture graph 
in which Yana was more closely related to a 
Pleistocene wolf from Taimyr Peninsula than 
to modern wolves, whereas Zhokhov represents 
a lineage that diverged from the ancestor of 
present-day sled dogs (Fig. 1C and figs. S8 and 
S9). This suggests genetic continuity in Arctic 
dog breeds for at least the past ~9500 years, 
setting a lower bound on the origin of the sled 
dog lineage. 

Next, D statistics indicated an excess of allele 
sharing between Yana-Taimyr wolves and PCDs- 
Zhokhov-sled dogs (Fig. 1D and fig. S14), corrob- 
orating previous reports (3, 5). This suggests 
that the admixture occurred between Pleistocene 
wolves and the ancestors of PCDs, Zhokhov, and 
sled dogs. 

Previous studies have demonstrated an asso- 
ciation between canine transmissible venereal 
tumors (CIVTs) and sled dogs, especially PCDs 
(3). Here, we evaluated the relationship among 
Zhokhov, two CTVT genomes (table S1), and 
dogs and wolves using f3 statistics and phylo- 
genetic analysis. Recent analyses of exome data 
suggested that CTVT expanded across Eurasia 
~6000 years ago (6), thus reducing the likeli- 
hood that this transmissible cancer originated 
in the Americas. In our study, both the phylo- 
genetic analysis (fig. S9) and f; statistics (fig. S10) 
placed the CIVT genomes closer to PCDs than 
to sled dogs or Zhokhov. These results suggest 
that the basal dog lineage that led to PCDs (3) 
occurred in Eurasia ~6000 years ago and/or 
there were multiple introductions of PCD-like 
dogs to the Americas. 

We used NGSadmix, admixture analyses, and 
D statistics (figs. S6 to S8 and S11 to S15) to 
evaluate gene flow and shared ancestry be- 
tween Zhokhov and modern dogs and wolves. 
We found no significant gene flow between 
any sled dog (including Zhokhov) and modern 
American-Arctic wolf populations compared 
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with the Eurasian wolf (fig. S15), suggesting 
that gene flow from modern wolves has not 
contributed to the sled dog gene pool within 
the past 9500 years. This result was surprising 
given genetic evidence for postdomestication 
admixture between other wolves and dog breeds 
(6, 7). Furthermore, ethnographic evidence from 
Greenland indicates that, at least historically, 
dog-wolf matings were not uncommon (8). If 
true, then the lack of gene flow from modern 
American-Arctic wolves into sled dogs implies 
selection against hybrids. 

The clustering and admixture results show 
gene flow between some sled dogs and other 
modern dog breeds (Fig. 1C and figs. S6 to S8). 
We further explored this by comparing pairs 
of sled dogs with Zhokhov using D statistics 
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(Fig. 2A). Although pairs of Greenland sled dogs 
are symmetrically related to Zhokhov (D~0), 
indicating a lack of admixture, comparisons 
involving non-Greenland sled dogs were not 
always consistent with the null hypothesis 
of no admixture. D-statistics and admixture 
analyses (Fig. 2B and fig. S13) indicated that 
non-Greenland sled dogs carry ancestry from 
non-sled dogs and that Greenland sled dogs 
are the least admixed. These results imply that 
Greenland sled dogs have largely been kept 
isolated from contact with other dog breeds, 
and that their lineage traces more genomics 
ancestry to Zhokhov-like dogs relative to other 
dog breeds. Isolation of Greenland sled dogs 
was supported by inference of their histori- 
cal effective population size (fig. S16), which 
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Fig. 1. Geographic location of the samples and overall genetic affinities. 
(A) Identity by state pairwise distances between Zhokhov and present-day 
dogs (table S1) of geographic affiliation of dogs and archaeological sites. 
Color scale indicates genetic distance between Zhokhov and each sample. 
Circles and triangles represent modern and ancient dogs, respectively. 
Stars show Zhokhov and Yana sites. (B) Principal component analysis 
(PCA) using whole-genome data (2,200,623 transversion sites) on all 
samples. (C) TreeMix admixture graph built using whole-genome data 
(766,082 transversion sites) on a dataset consisting of 66 canids merged 
into 15 groups according to their geographic location and admixture 
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showed that these dogs had a relatively stable 
population size until a severe bottleneck 
~850 years ago. The timing of the bottleneck 
is consistent with the colonization of Green- 
land by Inuit (9), suggesting isolation in Green- 
land ever since. 

Numerous generations of sled dogs living in 
the Arctic environment and being used as 
draft animals may have provided a unique 
selection pressure to these dogs. To detect pu- 
tative signals of positive selection, we used 
population branch statistics (PBS) (10) to scan 
for genomic regions highly differentiated in 
modern sled dogs relative to non-sled dogs 
(hereafter referred to as “other dogs”) and 
wolves. We computed these statistics on mod- 
ern genomes of 17 sled dogs, 61 other dogs, 
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and fig. S6). Colors indicate main groups as in (B). Arrows 
show inferred admixture edges colored by migration weight. (D) D statistic 
1, boxer dog; Taimyr or Yana, Andean fox) testing for 
Pleistocene wolf gene flow in ancient and modern dogs and whether samples 
share more alleles with Taimyr (x-axis) or Yana (y-axis) wolves when compared 
og. Color indicates the type of sample in H1. Points show 

the D statistic, and horizontal and vertical lines show 3 SEs for the test with the 
Taimyr (x-axis) and Yana (y-axis), respectively. The results obtained from 

ves fall along the diagonal, suggesting that they are 
symmetrically related to all dogs. 
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and 30 wolves (table S1). A sliding window 
analysis revealed several genomic regions 
with high PBS values, hinting at selection 
in sled dogs (Fig. 3A). We took an outlier 
approach and focused on the most extreme 
values of the empirical distribution (above 
the 99.95th percentile). For each of these 
outlier regions (table S4), we identified over- 
lapping genes and compared haplotypes 
across samples. 

Enrichment analysis (4) on genomic regions 
with high PBS values (above the 99.95th per- 
centile) identified three gene ontology (GO) 
terms that were overrepresented (table S6): 
y-aminobutyric acid secretion (GO: 0014051, 
p = 0.119), calcium ion import (GO: 0070509, 
p = 0.119), and calcium ion transmembrane 
transport (GO: 0070588, p = 0.382). To inves- 
tigate further, we focused on eight genomic 
regions that are highly differentiated in sled 
dogs and three regions where other dogs dif- 
fer from sled dogs and wolves (Fig. 3A and fig. 
$18), and validated the autosomal regions with 
across-population composite likelihood ratio 
statistic (5) (fig. S21). In the differentiated re- 
gions, we focused on two sets of genes: those 
in which Zhokhov carries the same haplotype 
as modern sled dogs and those involved in 
adaptation to different diets. 
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TRPC4 is highly differentiated in sled dogs, 
and the putatively selected haplotype bears a 
marked similarity to Zhokhov (Fig. 3, A and B). 
TRPC4 is a transient receptor potential (TRP) 
channel protein that plays an important role 
in vasorelaxation and lung microvascular per- 
meability (21). It is also involved in a temper- 
ature sensitivity pathway (72, 13), where it 
interacts with TRPV2, which is also highly 
differentiated in sled dogs (99.8th PBS per- 
centile; table S4 and fig. S19A) and codes for 
temperature and potentially pain receptors 
(14). Several related thermo-TRP sensors in 
the same pathway, calcium ion transmembrane 
transport, have been previously reported to be 
under selection in cold-adapted woolly mam- 
moths (15), which suggests convergent evolu- 
tion in Arctic adaptation. 

Another highly differentiated gene in sled 
dogs is CACNAIA (Fig. 3, A and C), a calcium 
channel subunit that plays an essential role 
in skeletal muscle contraction (/6). Further, 
CACNAIA has been reported to be under posi- 
tive selection in humans, specifically the Bajau 
sea nomads (17), where it is involved in hy- 
poxia adaptation (78), indicating a possible 
role in managing exercise-induced hypoxia 
in sled dogs. We hypothesize that the TRPC#, 
TRPV2, and CACNAIA genes are involved in 
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functions beneficial to physical activity in the 
Arctic. If so, given that the differentiated hap- 
lotypes are also found in Zhokhov (Fig. 3, A 
and B, and fig. SI9A), any advantages that they 
confer would have been important to dogs in 
the Arctic ~9500 YBP. 

Most domestic dogs are adapted to starch- 
rich diets through marked increases in 
AMY2B copy numbers and strong positive 
selection for a dog-specific MGAM haplo- 
type (19). Consistent with previous findings 
(20), we observed that sled dogs carry sub- 
stantially fewer AMY2B copies than other 
dog breeds (fig. S20). We also found that 
MGAM and AMY2B are the regions of the 
genome with the lowest PBS, suggesting high 
differentiation of other dogs relative to sled 
dogs and wolves (Fig. 3A). Because negative 
PBS can arise under different demographic 
scenarios, we confirmed these observations 
by computing PBS with other dogs as the 
focal population (fig. S18). Indeed, modern 
sled dogs and Zhokhov are among the only 
dogs in our dataset that carry the ancestral 
MGAM haplotype found at high frequency in 
wolves (Fig. 3C and fig. S18). Therefore, our 
observations suggest that sled dogs do not 
carry the genetic adaptations to starch-rich 
diets seen in other dog breeds. 
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Fig. 2. Relationships between Zhokhov and present-day sled dogs. (A) D 
statistics testing the relationships between pairs of sled dogs and Zhokhov. 
Cell colors indicate the Z scores obtained from the test D(dogl, dog2; 
Zhokhov, Andean fox), where dogl and dog2 are all possible pairs of sled 
dogs. Comparisons involving pairs of Greenland sled dogs and non-Greenland 
sled dogs resulted in significant deviations from HO (|Z| > 3). (B) D statistics 
showing that sled dogs that are significantly further from Zhokhov compared with 
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Greenland sled dog Aasiaat 2 [y-axis: D(Greenland sled dog Aasiaat 2, H2: 
Zhokhov, Andean fox)] also show evidence of significant gene flow from other 
dogs [x-axis: D(Greenland sled dog Aasiaat 2, H2; German shepherd dog, 
Andean fox)]. Points indicate the D statistic, and horizontal and vertical 
lines indicate 3 SEs for the x-axis and y-axis, respectively. We consider the 
test to be significant for gene flow when these lines do not overlap with 

the dotted line (|Z| > 3). 


3 of 5 


RESEARCH | REPORT 


Fig. 3. Adaptation. 
(A) Manhattan plot of oe 4 
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By contrast, sled dogs harbor specific hap- 
lotypes of genes involved in coping with a high 
intake of fatty acids. SLC25A40, a mitochondrial 
carrier protein involved in clearing triglycerides 
from the blood (27), and APOO, an apolipo- 
protein gene involved in regulating high levels 
of fat and fatty acid metabolism (22), are both 
highly differentiated in sled dogs (Figs. 3A). 
The derived haplotypes of both genes are ab- 
sent in Zhokhov, indicating that the haplo- 
types are specific to modern sled dogs and 
postdate their common ancestors with Zhokhov 
(fig. S19, B and E). As another example of con- 
vergent evolution, another gene of the apolipo- 
protein family, APOB, is reported to be under 
selection in polar bears, possibly as a result of 
adaptation to fat-rich diets and clearance 
of cholesterol from the blood (23). Overall, 
similar adaptations to high intake of fatty 
acids have been described in Inuit and other 
Arctic human populations (24, 25), so our ob- 
servations suggest that sled dogs adapted to a 
fat-rich and starch-poor diet, echoing the dietary 
adaptations of the Arctic human cultures with 
whom they coexisted. 
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Bone composition of polar bears and rein- 
deer consumed at the Zhokhov site indicate an 
extensive hunting range and transport of large 
body parts back to camp (26). Further, abun- 
dant obsidian tools found at the site reveal 
movement of obsidian from ~1500 km away 
(3). Together, these findings indicate substan- 
tial long-distance travel and transportation 
of resources, in which dog sledding would 
have been highly advantageous—if not neces- 
sary. Putative sled remains and our genomic 
analyses of a 9500-year-old dog from the 
Zhokhov site indicate that the traditions and 
key genomic variations that define modern 
sled dogs were established in the northeast 
Asian Arctic >9500 years ago. Our results 
imply that the combination of these dogs with 
the innovation of sled technology facilitated 
human subsistence since the earliest Holocene 
in the Arctic. 
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CORONAVIRUS 


Structural basis for inhibition of the RNA-dependent 
RNA polymerase from SARS-CoV-2 by remdesivir 
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The pandemic of coronavirus disease 2019 (COVID-19), caused by severe acute respiratory syndrome 
coronavirus 2 (SARS-CoV-2), has become a global crisis. Replication of SARS-CoV-2 requires the viral 
RNA-dependent RNA polymerase (RdRp) enzyme, a target of the antiviral drug remdesivir. Here we report the 
cryo-electron microscopy structure of the SARS-CoV-2 RdRp, both in the apo form at 2.8-angstrom 
resolution and in complex with a 50-base template-primer RNA and remdesivir at 2.5-angstrom resolution. 
The complex structure reveals that the partial double-stranded RNA template is inserted into the central 
channel of the RdRp, where remdesivir is covalently incorporated into the primer strand at the first 
replicated base pair, and terminates chain elongation. Our structures provide insights into the mechanism 
of viral RNA replication and a rational template for drug design to combat the viral infection. 


he coronavirus disease 2019 (COVID-19) 

pandemic that has arisen from wide- 

spread severe acute respiratory syndrome 

coronavirus 2 (SARS-CoV-2) infection 

has become a humanitarian crisis, with 
more than 1.5 million infections and 87,000 
deaths reported on 8 April 2020 (J, 2). These 
numbers have increased rapidly to more 
than 2.99 million infections and 207,000 
deaths as of 27 April of 2020 (2). SARS-CoV-2 is 
closely related to severe acute respiratory syn- 
drome coronavirus (SARS-CoV) and several 
members of the betacoronavirus family, in- 
cluding bat and pangolin coronaviruses (3-5). 
Compared with the binding behavior of other 
coronaviruses, however, the spike protein of 
SARS-CoV-2 has a stronger binding affinity 
for the host receptor (6-10), which may explain 
why SARS-CoV-2 has a much higher incidence 
of human-to-human transmission, resulting 
in infections throughout the world. 
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SARS-CoV-2 is a positive-strand RNA vi- 
rus. Its replication is mediated by a multi- 
subunit replication-and-transcription complex 
of viral nonstructural proteins (nsps) (JJ). 
The core component of this complex is the 
catalytic subunit (nsp12) of an RNA-dependent 
RNA polymerase (RdRp) (72, 13). By itself, 
nsp12 has little activity and its functions re- 
quire accessory factors, including nsp7 and 


A nsp7(1-83) 
=a 


nsp12(1-932) 
6-hairpin 


nsp8 (14, 15), that increase RdRp template 
binding and processivity. RdRp is also pro- 
posed to be the target of a class of antiviral 
drugs that are nucleotide analogs; this cat- 
egory includes remdesivir (16-18), which 
is a prodrug that is converted to the active 
drug in the triphosphate form [remdesivir 
triphosphate (RTP)] within cells (79). As 
such, RdRp has been the subject of inten- 
sive structural biology efforts. The structures 
of nsp7, nsp8, and the complex of nsp12-nsp7- 
nsp8 have been determined (15, 20-23), pro- 
viding the overall architecture of the RdRp 
complex. However, the drug discovery effort 
is hampered because the structures of the 
SARS-CoV-2 RdRp in complex with an RNA 
template and with nucleotide inhibitors are 
not known. In this study, we determined two 
cryo-electron microscopy (cryo-EM) structures 
of the SARS-CoV-2 RdRp complex: one in the 
apo form and the other in a complex with a 
template-primer RNA and the antiviral drug 
remdesivir. 

For cryo-EM studies, we coexpressed nsp12 
with nsp7 and nsp8 to form the core RdRp 
complex in insect cells (Fig. 1A and fig. S1, 
Ato D). The stoichiometric amount of nsp7 
and nsp8 appeared to be less than that of 
nsp12, and thus additional bacterially ex- 
pressed nsp7 and nsp8 were supplemented 
before the final purification step to improve 
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Fig. 1. Assembly of an active nsp12-nsp7-nsp8 RdRp complex and its inhibition by remdesivir. 

(A) Schematic diagram for the components of the RdRp complex, containing nsp12, nsp7, and nspé. 
The polymerase motif (A to G) and the B hairpin specific to SARS-CoV-2 are highlighted. (B) Sequence of 
the RNA duplex with a 5' Uyo overhang as a template for primer extension and RdRp-RNA complex 
assembly. (©) The recombinant RdRp complex shows polymerase activity in vitro. The primer strand is 
labeled with fluorescence at the 5’ end. (D) Elongation of the partial RNA duplex by the purified RdRp 
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the yield of heterotrimeric complex. The pu- 
rified nsp12 alone showed little activity in 
binding to a 50-base partial double-stranded 
template-primer RNA (Fig. 1B and fig. SIE), 
which is similar to the SARS-CoV nsp12 (74). 
The presence of nsp7 and nsp8 markedly in- 
creased nsp12 binding to the template-primer 
RNA (fig. SIE). The nsp12-nsp7-nsp8 complex 
also showed RNA polymerization activity 
on a poly-U template upon addition of adeno- 
sine triphosphate (ATP) (Fig. 1, B and C). This 
RNA polymerization activity was effectively 
inhibited by the addition of RTP (Fig. 1D). 
Even in the presence of 10 mM ATP, which is 
within the range of physiological concentra- 
tions of ATP, 1mM RTP completely inhibited 
RdRp polymerization activity. In addition, 
100 uM RTP completely blocked the full ex- 
tension but allowed partial extension of the 
primer RNA (Fig. 1D), consistent with a de- 
layed chain termination mechanism (24). 
However, this mechanism is dependent on 
low RTP concentrations and low RTP/ATP 
ratios. By contrast, 5 mM remdesivir (as a pro- 
drug) had no inhibitory effect on the polym- 
erization activity of the purified enzyme (fig. 
SIF), nor did remdesivir in its monophosphate 
form (RMP) (fig. S1G). 

The purified RdRp complex is relatively 
thermostable, with a melting temperature 
of 53°C (fig. SIH). Negative-stain EM visual- 
ization of the nsp12-nsp7-nsp8 complex dis- 
played monodispersed particles with excellent 
homogeneity (fig. S1I). For the apo nsp12- 
nsp7-nsp8 complex, we vitrified the sample in 
the presence of the detergent DDM. The ini- 
tial attempt at image processing revealed that 
the particles are preferentially oriented (fig. 
S2A). Therefore, we collected >7400 micro- 
graph movies of more than 5.7 million par- 
ticle projections to increase the number of 
projections from the nonpreferential orien- 
tation. Of these, 81,494 particles were used to 
yield a density map of 2.8-A resolution (fig. S2, 
B and E). Cryo-EM studies of the nsp12-nsp7- 
nsp8 complex bound with the template-primer 
RNA and RTP (termed the template-RTP RdRp 
complex) faced two challenges (fig. S3). First, 
most particles were adsorbed to cryo-EM grid 
bars rather than staying in the vitreous ice. 
Second, the RNA duplex was dissociated from 
the template-RTP RdRp complex, likely owing 
to conditions of cryo-EM specimen prepara- 
tion. Eventually, we prepared the cryo-EM 
specimen of the template-RTP RdRp complex 
at 15 mg/ml, which is much higher than the 
normal concentrations used for EM studies 
of soluble protein complexes. The high con- 
centration of the complex has a mass action 
effect to stabilize the RNA-protein complex 
and has an excess amount of the complex to 
escape the absorption of cryo-EM grid bars to 
enter the vitreous ice (fig. $3). We collected 
2886 micrograph movies, which yielded a 
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Fig. 2. Cryo-EM structure of the apo nsp12-nsp-7-nsp8 RdRp complex. (A and B) Two views of the 
cryo-EM map (A) and structure (B) of the apo nsp12-nsp7-nsp8 complex. The color scheme is according to 
Fig. 1A and is used throughout the figures. (©) The conserved zinc binding motifs are highlighted in the 
apo structure rendered in ribbon. The coordinate details of the zinc-binding residues are shown in stick 
representation, with the EM map in gray surface representation. 


2.5-A resolution structure using 130,386 par- 
ticle projections. Because of the relatively high 
resolution of our structure, the EM map is 
clear for key structural features across the 
complex (fig. S4, A to F). 

The structure of the apo RdRp complex 
contains one nsp12, one nsp7, and two nsp8& 
proteins, with an overall arrangement resem- 
bling those seen in SARS-CoV and the re- 
cently solved structure of SARS-CoV-2 (15, 23) 
(Fig. 2, A and B). Our structure, which dif- 
fers from the SARS-CoV RdRp structure but 
is similar to the recent SARS-CoV-2 RdRp 
structure, reveals that nsp12 also contains 
an N-terminal f hairpin (residues 31 to 50) 


and an extended nidovirus RdRp-associated 
nucleotidyl-transferase domain (NiRAN; re- 
sidues 115 to 250) (24) with seven helices 
and three f strands (15, 23). After the NiRAN 
domain is an interface domain (residues 251 
to 365), composed of three helices and five B 
strands, that is connected to the RdRp do- 
main (residues 366 to 920) (Figs. 1A and 
2B). The nsp12 RdRp domain displays the 
canonical cupped right-handed configura- 
tion (25), with the finger subdomain (residues 
397 to 581 and 621 to 679) forming a closed 
circle with the thumb subdomain (residues 
819 to 920) (Fig. 2, A and B). The closed con- 
formation is stabilized by the binding of 
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nsp7 and nsp8, with one nsp8 molecule sit- 
ting atop the finger subdomain and inter- 
acting with the interface domain. The closed 
conformation of nsp12 is further stabilized by 
the nsp7-nsp8 heterodimer, which is packed 


against the thumb-finger interface (Fig. 2, 
A and B). In addition, we were able to assign 
two zinc ions in the conserved metal binding 
motifs composed of H295-C301-C306-C310 and 
C487-H642-C645-C646 (Fig. 2C), which are 
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Fig. 3. Cryo-EM structure of the remdesivir- and RNA-bound RdRp complex. (A and B) Two views of 
the cryo-EM map (A) and structure (B) of nsp12-nsp7-nsp8 in complex with template-primer RNA and 
remdesivir. (€) Surface view of the RdRp active site with the electrostatic potential from red (negative) to 
blue (positive). For clarity, residues 410 to 442 and 834 to 919 of nsp12 and nsp8 are excluded from 

the figure. The covalently bound remdesivir in the monophosphate form and the product, pyrophosphate, 
are shown. The active site is emphasized with a yellow dashed circle. The template groove, the entrance 
for nucleotide triphosphate (NTP), and the elongation direction are annotated with different-colored arrows. 
(D) Close-up view of the RdRp active site, showing the covalently bound RMP, pyrophosphate, and 
magnesium ions. Key residues and bases that interact with remdesivir are shown. (E and F) Superposition 
of the conserved RdRp motifs (A to G) of the RNA-bound complex with the apo structure (colored in gray), 
with a close-up view at the active site (E) and at the exit of the template and primer strand (F). 
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also observed in the SARS-CoV RdRp struc- 
ture (15). These zinc ions likely function as 
conserved structural components in maintain- 
ing the integrity of the RdRp architecture. 

The structure of the template-RTP RdRp 
complex contains one nsp12, one nsp7, and 
one nsp8 (Fig. 3, A and B). The second nsp8 was 
largely invisible in the EM map of the template- 
RTP complex (fig. S4C); therefore, it was not 
included in the final model. In addition, the 
template-RTP RdRp structure contains 14-base 
RNA in the template strand, 11-base RNA in the 
primer strand, and the inhibitor RMP (Fig. 3, 
C and D), which is covalently linked to the 
primer strand, as well as a pyrophosphate and 
two magnesium ions that may serve as cata- 
lytic ions near the active site (Fig. 3D and fig. 
S4, D to F) (26). 

The overall structure of the template-RTP 
RdRp complex is similar to the apo RdRp 
structure, with nsp12 in a closed conforma- 
tion (Figs. 2A and 3A). The double-stranded 
RNA helix, formed by 11 base pairs from the 
template-primer RNA (Figs. 3C and 4, A to E), 
is held by the finger-palm-thumb subdomains. 
Extensive protein-RNA interactions are ob- 
served between the template-primer RNA and 
nsp12, with a total of 41 residues from nsp12 
directly participating in RNA binding (within 
4.0 A of RNA, 26 residues to the template 
strand and 15 residues to the primer strand; 
Fig. 4E). Surprisingly, no RNA interactions 
are mediated by nsp7 or nsp8, although these 
two proteins are required for RNA binding by 
RdRp. Most protein-RNA interactions involve 
the RNA phosphate-ribose backbones, with 
many interactions directly to 2'-OH groups 
(Fig. 4E), thus providing a basis to distinguish 
RNA from DNA. There are no contacts from 
nsp12 to any base pairs of the template-primer 
RNA, suggesting a sequence-independent bind- 
ing of RNA by RdRp. This is consistent with 
the fact that no specific sequence is required 
for the enzymatic activity of RdRp at the elon- 
gation step. 

At the 3’ end of the primer strand is RMP 
(Figs. 3D and 4, D and E, and fig. S4, E and F), 
which is covalently incorporated into the primer 
strand at the +1 position (Fig. 4E). Additional 
nucleotides at the +2 and +3 positions of the 
template strand interact with residues from 
the back of finger subdomain (Fig. 4, A and B). 
Despite the presence of excess RTP in complex 
assembly, only a single RMP is assembled into 
the primer strand, as observed in the struc- 
ture. Consistent with the data from Fig. 1D, the 
primer extension is immediately terminated 
when RTP concentration is high and ATP/RTP 
ratio is low. Thus, remdesivir, like many nu- 
cleotide analog prodrugs, inhibits viral RdRp 
activity through nonobligate RNA chain termi- 
nation, a mechanism that requires conversion 
of the parent drug to the active triphosphate 
form (27, 28). 
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The RMP position is at the center of the 
catalytic active site (Fig. 3D). As an adenosine 
monophosphate analog, RMP forms base- 
stacking interactions with an upstream base 
from the primer strand and two hydrogen 
bonds with the uridine base from the tem- 
plate strand (Fig. 3D and fig. $5). In addition, 
RMP forms interactions with side chains from 
K545 and R555. Near the bound RMP are two 
magnesium ions and a pyrophosphate. Both 
magnesium ions interact with the phosphate 
diester backbone and are part of the catalytic 
active site. The pyrophosphate is at the gate 
of the nucleotide entry channel to the active 
site and may block the entry of nucleotide 
triphosphate to the active side (Fig. 3, C 
and D). 

The catalytic active site of the nsp12 RdRp 
consists of seven conserved motifs (A to G; 
Figs. 1A and 3E and fig. S6). Motifs A, B, C, and 
D are from the palm subdomain, with an SDD 
sequence (residues 759 to 761) in motif C form- 
ing the catalytic active center (Fig. 3D). Both 
D760 and D761 are involved in coordination 
of the two magnesium ions at the catalytic 
center. Motifs F and G are located within the 
finger subdomain; they interact with the tem- 
plate strand RNA and direct this strand into 
the active site (Fig. 3E). Motif F also interacts 
with the primer strand RNA, with the side 
chains of K545 and R555 contacting the +1 base 
(Fig. 3D) and thus stabilizing the incoming 
nucleotide in the correct position for catalysis. 
The orientation of template-primer RNA in 
the active site is similar to the orientation of 
template-primer RNA in the poliovirus RdRp 
elongation complex (29) and the hepatitis C 
virus NS5B RdRp inhibitor complex (30) (fig. 
87). The residues involved in RNA binding 
and those that constitute the catalytic active 
site are highly conserved (31, 32), highlight- 
ing the conserved mechanism of genome 
replication by RdRp in these diverse RNA 
viruses and suggesting that it may be pos- 
sible to develop broad-spectrum antiviral in- 
hibitors such as remdesivir (18) and galidesivir 
(BCX4430) (33). 

Structural comparison reveals several nota- 
ble differences between the apo and complex 
structures (Fig. 3, E and F, and fig. S8, A and 
B). First, nsp7 moves toward the RdRp core by 
1.5 A (as measured by nsp7 residue F49; fig. S8, 
A and B), leading to a rearrangement of the 
interface, which results in weaker association 
of the second nsp8 in the complex. Second, the 
loop that connects the first and second helices 
of the thumb subdomain moves outward by 
2.8 A (as measured by nsp12 residue 1847) 
to accommodate the binding of the double- 
stranded RNA helix (Fig. 3F). Third, motif G 
residues K500 and S501 also move outward 
by 2.0 Ato accommodate the binding of the 
template-strand RNA. Outside of these changes, 
the apo nsp12 and the RNA complex nsp12 are 


Yin et al., Science 368, 1499-1504 (2020) 


26 June 2020 


Primer : 
Template 


GH 2 7 
Thumb Wa.) 
F920 yig24e7.1864 | 
\| iin Pr C 
WwW ( \ | sh a > a 


VIZAN 1h in) 


Template 


Primer 


Huet 2 o 
Hae a 8 


S 1 
S. 1 
Ls, ifs i 
1 SS, 

. ieee ieee ae | 


i 
‘ 


i 
i 


iy 
rN. 
a: 


Pf 
a 
. 


Fig. 4. RNA recognition by the RdRp complex. (A to D) Protein-RNA interactions in the RNA- and remdesivir- 
bound RdRp complex. (E) Diagram of detailed RNA interactions with RdRp. Single-letter abbreviations for 

the amino acid residues are as follows: A, Ala; C, Cys; D, Asp; E, Glu; F, Phe; G, Gly; H, His; I, Ile; K, Lys; L, 
Leu; M, Met; N, Asn; P, Pro; Q, Gln; R, Arg; S, Ser; T, Thr; V, Val; W, Trp; and Y, Tyr. 


very similar, with a root mean square devia- 
tion of 0.52 A for all Co atoms across the whole 
protein. In particular, the structural elements 
that make up the catalytic active site can be 
precisely superimposed (Fig. 3E), which sug- 
gests that the SARS-CoV-2 RdRp is a relatively 
stable enzyme that is ready to function asa 
replicase upon RNA template binding. Viral 
RdRp is a highly processive enzyme with a rep- 
lication rate of up to 100 nucleotides per second 
(34). No substantial conformational changes 
between the apo and active enzyme structures 
are consistent with the high processivity of the 


viral RNA polymerase, which does not need 
to consume additional energy for conforma- 
tional changes in the active site during the 
replication cycle. 

Besides remdesivir, several nucleotide ana- 
log drugs—including favipiravir, ribavirin, 
galidesivir, and EIDD-2801—efficiently inhibit 
SARS-CoV-2 replication in cell-based assays 
(35, 36). Like remdesivir, these nucleotide ana- 
logs are proposed to inhibit the viral RdRp 
through nonobligate RNA chain termination, 
a mechanism that requires conversion of the 
parent compound to the triphosphate active 
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form (33). The structure of the template-RTP 
RdRp complex provides a useful model to 
rationalize how these drugs inhibit the ac- 
tivity of SARS-CoV-2 RdRp (fig. S8C). In par- 
ticular, EIDD-2801 has been shown to be 3 to 
10 times as potent as remdesivir in blocking 
SARS-CoV-2 replication (36). The N4 hydroxyl 
group off the cytidine ring forms an extra 
hydrogen bond with the side chain of K545, 
and the cytidine base also forms an extra 
hydrogen bond with the guanine base from 
the template strand. These two extra hydro- 
gen bonds may explain the apparent higher 
potency of EIDD-2801 in inhibiting SARS- 
CoV-2 replication. 

The COVID-19 pandemic has inflicted emo- 
tional pain and economic burden across the 
globe. Enzymes that are vital for the viral life 
cycle are suitable antiviral drug targets be- 
cause they differ from host proteins. Among 
viral enzymes, RdRp is the primary target 
of many existing nucleotide drugs. In this 
paper, we report the structures of the SARS- 
CoV-2 RdRp complex in the apo form as well 
as in complex with a template-primer RNA 
and the active form of remdesivir. The struc- 
tures reveal how the template-primer RNA is 
recognized by the enzyme and how chain 
elongation is inhibited by remdesivir. Struc- 
ture comparison and sequence alignment 
suggest that the mode of substrate RNA rec- 
ognition and remdesivir inhibition of RdRp 
is highly conserved in diverse RNA viruses, 
providing a foundation for designing broad- 
spectrum antiviral drugs based on nucleo- 
tide analogs. Moreover, our structures provide 
a solid template for modeling and modifying 
the existing nucleotide drugs, including the 
highly potent EIDD-2801. Together, these ob- 
servations provide a rational basis to design 
even more potent inhibitors to combat SARS- 
CoV-2 infection. 
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The pressure to assimilate 


hy are you dressed so nicely?” a fellow graduate student asked me in passing, after noticing 
my collared shirt, slacks, and dress shoes. “Oh, I have to teach today,’ I replied. He stopped and 
stared at me for a few moments, the confusion written plainly on his face. “As a Black man, 
students treat me with more respect when I dress up,” I explained. What I did not say was, “Our 
society’s current idea of professionalism is so intertwined with straight, white, masculinity that 
underrepresented people must go above and beyond or risk being seen as incompetent.” 


I love teaching. However, for a Black 
man in a STEM department, it can 
be taxing, frustrating, and daunting 
to figure out how to present myself 
to undergraduate students. Taxing 
because I have to learn to interact 
with students who may have pre- 
conceived notions of my abilities. 
Frustrating because the number of 
colleagues who share my experi- 
ences or can offer me guidance is 
essentially zero. Daunting because 
for some of these students, their ex- 
perience with me will likely shape 
how they view all people of color. 

During my first year as a teach- 
ing assistant (TA), I noticed some 
students didn’t fully trust or respect 
me as an instructor. For example, 
one of my students emailed another 
TA to ask the very same question I’d 
answered for them earlier in the day. 
Another incident that got under my skin was when a stu- 
dent said I looked like rapper Wiz Khalifa, then proceeded 
to touch my dreadlocks. I felt embarrassed and violated, but 
I pretended their comment and action didn’t bother me. I 
feared that correcting the student would only present me as 
an “angry Black man.” 

To garner more respect and confidence from my students, 
I started to diligently craft a mask that I thought would 
signal my professionalism. I wore nice clothes on teaching 
days and I reduced my use of Ebonics, an English dialect 
spoken by some Black Americans that I often used grow- 
ing up. This mask seemed to work. I noticed my students 
were more engaged, asking questions and seeking me out 
for help. But the effort of maintaining my facade ultimately 
became unsustainable. Eventually, my mask cracked. 

I was waiting in my office for a student, another Black 
man, to arrive to discuss his lab report. As I responded to 
emails, a rap album by Kendrick Lamar was playing. I was 
startled when the student exclaimed, “Yo! Trai, I didn’t know 
you got down with Kendrick.” This catalyzed an animated 
discussion about rappers, growing up in big cities, and 


“The mask | crafted was 
disingenuous and was alienating 
students of color.” 


people of color in STEM. My guard 
dropped, the Ebonics slipped out, and 
for a short period I was simply me. 
As I shifted our conversation to his 
assignment, he said offhandedly, “I 
wish you were more like this in class.” 
He wasn’t ill-intentioned, but his 
comment did prompt introspection. 

I realized that although my 
“professional” mask appealed to 
my white students, it had unfore- 
seen consequences. The mask I 
crafted was disingenuous and was 
alienating students of color while 
reinforcing the misconception that 
the only way to succeed is through 
cultural assimilation. 

After this revelation, I discarded 
my mask. I continued to dress nicely 
because that’s something I enjoy. But 
I started to speak Ebonics more. I 
also decided to take time, at the be- 
ginning of every semester, to tell my students about my edu- 
cational and cultural background. This serves a dual purpose: 
It outlines my teaching qualifications and demonstrates, to 
underrepresented students especially, my pride in my culture 
and upbringing. 

Admittedly, I’m not sure whether this is the best approach, 
and I do feel disheartened that I have to make a special effort 
to build credibility with my students. But I have noticed it has 
paid dividends. All my students have remained engaged, my 
underrepresented students seem empowered, and I no longer 
feel the need to put on an insincere mask. 

I encourage my STEM colleagues to think seriously about 
who is more professional. Is it the man who wears button- 
down shirts or the person who is knowledgeable in their field? 
Is it the woman with straight hair or the person who leads 
productive discussions? A true professional doesn’t have a 
prescribed appearance. I’m a young Black man with tattoos 
and dreadlocks who is fluent in Ebonics. I’m also a behavioral 
ecologist, activist, and educator. I am a professional. 


Montrai Spikes is a Ph.D. student at the University of Oklahoma. 
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